terminal-csv-splitter
TypeScript icon, indicating that this package has built-in type declarations

2.0.1 • Public • Published

terminal-csv-splitter

A CSV splitter that uses inbuilt terminal commands.

oclif Version Downloads/week License

Introduction

This CLI splits files using split, sed and mv. It will only work on systems that have these binaries installed and accessible. By default, it creates a new folder in your current position with the name /split-files and stores the result there.

Performance

I ran this over a file with over 35 million lines (35584137 to be exact) and I set lines to be 500000 per file. The process took a total of 22.297s to complete. Here are a few examples:

| Rows       | Time (s) |
|------------|----------|
| 35,584,137 | 22.297   |
| 33,660,564 | 20.327   |
| 1,858,407  | 0.348    |

In most cases, other CLIs that I used were not able to run on files of this size, so I don't have a benchmark against others (yet).

Installation

You can install this CLI globally using:

npm install -g terminal-csv-splitter

Or you can use npx to run it without installing it.

Binaries

Binaries are included on the releases page.

Usage

The purpose of this package is to make it easy to split files into multiple ones (with or without headers). The file name and number of lines are required, the other flags can be left out. Some examples of usage include:

terminal-csv-splitter ./my_file.csv --lines=500000
terminal-csv-splitter ./my_file.csv --lines=500000 --out=./custom/output/folder
terminal-csv-splitter ./my_file.csv --lines=500000 --no-header

Using npx would look as follows:

npx terminal-csv-splitter ./my_file.csv --lines=500000

Running terminal-csv-splitter --help shows you the flags and their defaults:

Splits a file into multiple CSVs. You are required to have split, sed and mv installed for this to work!

USAGE
  $ terminal-csv-splitter [FILE] --lines=lines

OPTIONS
  -H, --no-header    If there is no header in the csv then set this to true
  -h, --help         show CLI help
  -l, --lines=lines  (required) The number of lines per file
  -o, --out=out      [default: ./split-files] The output directory
  -v, --version      show CLI version

EXAMPLES
  terminal-csv-splitter ./my_file.csv --lines=500000
  terminal-csv-splitter ./my_file.csv --lines=500000 --out=./custom/output/folder
  terminal-csv-splitter ./my_file.csv --lines=500000 --no-header

Package Sidebar

Install

npm i terminal-csv-splitter

Weekly Downloads

1

Version

2.0.1

License

MIT

Unpacked Size

23.6 kB

Total Files

13

Last publish

Collaborators

  • entrostat-developer