Show HN: Slipstream – A Python library for stateful stream processing
woile [3 hidden]5 mins ago
Looks interesting! Great work!
When using cache, if you have 2 topics, each with multiple partitions. How does the join operation works? what if the partitions don't have the same id?
skadamat [3 hidden]5 mins ago
Interesting! How do you see this comparing with Bytewax - https://github.com/bytewax/bytewax
menziess [3 hidden]5 mins ago
The link leads to the readthedocs.
The library can be used to parallelize data processing.
`AsyncIterables` can be used a data sources, and any `Callable` can be used as a sink.
RocksDB is used to preserve state.
Checkpoints are used to detect stream downtimes, which will pause dependent streams until the dependency streams have recovered and have caught up.
menziess [3 hidden]5 mins ago
Code: https://github.com/Menziess/slipstream-async