Introducing: OpenAleph and the Data and Research Center – DARC

:tada: :tada: :tada:
Hey all!

Some of you might already have seen it, but we’d like to take the opportunity here to announce some great updates regarding Aleph and its lovely open source community:

My organization investigativedata.io that most of you already know for offering managed Aleph instances has rebranded to the “Data and Research Center”, joined by 2 new co-founders, namely Karina Shedrofsky and Jan Strozyk. You know them as formerly leading the Data & Research Team at OCCRP. Read more about DARC: https://dataresearchcenter.org

Another great person you already know joined us: @alex (Alex Ștefănescu). Together with us they will work on OpenAleph, that we just launched as well. The purpose of this soft fork is to develop a new set of features while remaining compatible with Aleph’s data processing methodology (mainly FollowTheMoney).

After many discussions with OCCRP over the last year, we mutually agreed that forking the Aleph repository was the best path forward, as we plan to give OpenAleph its own direction.

This allows us now to move forward quickly with new features we are already working on, such as audio and video transription or improving the cross-reference. More to upcoming features and our general vision with OpenAleph will follow! You can read more about the launch of OpenAleph in this blog post.

Feel free to ask any questions in this thread. We appreciate that this might lead to confusion or more questions, so we are here and waiting for all your questions. No need to say that with starting OpenAleph we are standing on the shoulders of giants, and we are thankful for the insane work OCCRP and the open source community did on Aleph over the past years so far.

Let’s move forward together!

Cheers, Simon & DARC

3 Likes

Will it have search result highlights?

2 Likes

I will actually answer quite serious:

  1. Our Aleph deployments already run a quick fix that re-enables highlights for pdf documents. This makes the index a bit bigger, but we are solving this issue with our different deployment concept as storage is not a budget problem here.

  2. We are working on making re-indexing of collections (or a complete instance) much more faster, which will allow more fundamental changes to the elasticsearch mappings as reindexing doesn’t hurt anymore. This will be the more sustainable fix for the highlight issue.

1 Like