Bug fixes for step type, type checking, converting pdf to zarr and pandas deprecation fixes and also some extra testing.
Getting started pages in documentation and type annotations fixed
Test run which appears to miss a lot of tables due to assumptions such as page middle, columns, rotation correction etc.
First version which trains and predicts correctly using a truncated charset