HN.zip

Show HN: GS-Calc – A modern spreadsheet with Python integration

Process large (e.g. 4GB+) data sets in a spreadsheet.Load GB/32 million-row files in seconds and use them without any crashes using up to about 500GB RAM.Load/edit in-place/split/merge/clean CSV/text files with up to 32 million rows and 1 million columns.Use your Python functions as UDF formulas that can return to GS-Calc images and entire CSV files.Use a set of statistical pivot data functions.Solver functions virtually without limits for the number of variables.Create and display all popular chart types with millions of data points instantly.Suggestions for improvements are welcome (and often implemented quite quickly).

84 points by jpiech - 14 comments

14 Comments

nemoniac [3 hidden]5 mins ago
sunray2 [3 hidden]5 mins ago
So I'll take a layman's view here since I've only cursory experience of the big data tasks that this software seems to made for. Or maybe the pitch is still different and it went over my head.

It loads quick, and works with large data. Crucially, you can view and edit visually, not only programmatically.

Assuming those already working with such data have Excel and Python tools etc., the pitch here is that the $39 license fee saves time or effort. So, is it that the user can spot and correct errors that you couldn't otherwise do with either Excel or with other big data tools? And/or otherwise do the necessary data manipulations?

I came across the phrase 'eyes like a shithouse rat' recently, to describe the people doing final checks at a printing press. I think there's probably plenty of people out there who would pay $39 for eyes like a shithouse rat.

Also the website makes me nostalgic :)

tianqi [3 hidden]5 mins ago
It is an interesting tool. I've been struggling with Office Excel's inability to open large files. I always work with csv in python, and if a client must review the data in Excel, I take a random sample to generate a smaller file, then explain to the client that we can't open the whole in Excel. This really doesn't seem like a modern work.

"a slow, old pc with 8GB RAM"

By the way, this struck me like a humour of era. Oh god it has 8GB RAM. Cheers! To the good old days.

samzub [3 hidden]5 mins ago
Fully agree, I have found that above 300k rows Excel struggles even on a good laptop. Not even mentioning the Python integration into MS Excel that is so unbearably slow that it is much better performing the calculations outside of Excel first.

I am sold on the website looks and license model!

microflash [3 hidden]5 mins ago
These days I use DuckDB to read massive excel files. DuckDB now ships with a nice local UI and it also works beautifully with Datagrip, my preferred database IDE. With SQL, it just becomes a matter of applying old grease to do whatever analysis I want.
arvindh-manian [3 hidden]5 mins ago
I've been using Tad [0] for this purpose, as it streams in the data.

[0] https://www.tadviewer.com

conductr [3 hidden]5 mins ago
It all sounds very compelling , great work! But I have to ask, what’s the catch? This almost seems like it’s ready to fully replace Excel but I’ve seen many things die in that pursuit. What will Excel users miss by switching?

I’m don’t do a ton to big data stuff, but sometimes despite Excels stated row and column support- I find it effectively melts down if even 100K/100 of data and forget adding formulas.

jpiech [3 hidden]5 mins ago
Thanks. This is just a set of features that offer better (sometimes significantly better) performance and make it possible to work with large data sets in some predictable way, that is, knowing what the performance is with 100K formulas, you should be able to estimate how it'll work with 10 millions no matter how much formatting is applied, added hyperlinks, links to external closed workbooks, lists etc. That is probably correct, in the upcoming years there might be no way to compete directly with (the standalone) Excel. Hopefully it just helps to perform or verify various things way faster than in Excel with add-ins (or often at all).
dismalaf [3 hidden]5 mins ago
> What will Excel users miss by switching?

The ability to use legacy spreadsheets and macros.

Let's be real, Excel self perpetuates by at once being awful but also the thing everyone used to use thus must still use.

Lots of spreadsheet apps better than Excel have come and gone over the years...

ASalazarMX [3 hidden]5 mins ago
MS Office Macros are too easy for untrained users (that think of themselves as power users) to create, and once they grow big enough, it's very hard to port them to other applications. They're the monkey paw of spreadsheets.
tobwen [3 hidden]5 mins ago
VBA and the ability to use Office files with embedded VBA are disabled in many corporations… Some malware in the past used VBA for their attacks and Microsoft never added a proper sandbox.
akdor1154 [3 hidden]5 mins ago
Looks very cool, but Windows only?
jpiech [3 hidden]5 mins ago
Thanks. Yes, that was put off a bit too long, but there are plans to come up with the multiplatform version this year.
Yiling-J [3 hidden]5 mins ago
https://rowzero.io/ can handle 1 billion+ rows and offers native Python support. Also compatible with Excel and Google Sheets. However it’s a cloud based solution, and the private hosting option is only available to Enterprise users.