I’m currently a postdoc at University of Florida working with Dr. James Fairbanks and the AlgebraicJulia team.
I work with a team developing applied category theory software in Julia. The primary application here is building tools that make scientific computing and scientific knowledge representation easier, more transparent, and more robust to updating one’s model of the world. For an overview, see my talk Combinatorial Representations of Scientific Knowledge.
When we update our view of the world, it’s important to be able to migrate our old data, algorithms, and analysis tools into the new framework. When these tools are expressed in the language of C-sets, this migration can be automated in a verifiable way: Categorical data integration for computational science (furthermore, as stressed in the paper, these migration tools are of importance to scientists who wish to communicate and share data with lower risk of data misinterpretation).
Related application with Sean Wu: individual-based stochastic models for epidemiology
Knowledge representation and data integration
Scientists modeling complicated phenomena don’t use explicit (formally-specified) models. This is pragmatic, given current available open-source tools, but informal reasoning ulimately leads to serious challenges in communicating scientific results clearly and sharing data. A relational database backend is important for a scalable modeling tool, but a SQL-less interface is also crucial: the complexity of managing database implementation details quickly becomes unmanagable and unextensible as the model complexity increases. I develop a Python EDSL to help scientists generate relational databases from a natural declaration of scientific facts and to naturally query and publicly communicate their knowledge base.
Heterogenous data integration, using Category Theory There is little standardization in how data is to be represented and stored in many scientific fields. However, the varying schemas of different researchers contain significant overlap in information, and for data-driven fields it is especially beneficial to be able to freely switch from one frame of reference to another. Categorical Query Language, a tool developed by Conexus AI, allows for specification of data migration and integration in a declarative and clean process. I aim to show the utility of this approach in the field of computational chemistry, in collaboration with Ryan Wisnesky.
Machine learning applied to computational chemistry
A Chemical Analog to the Convolution Operation Along with Brian Rohr and Michael Statt, I develop models which quickly compute high-level properties of atomic systems from cheap descriptors by leveraging recent advances in ML (e.g. graph-convolutional and message-passing networks). The link goes to our poster which was awarded 1st place in Andrew Ng’s CS 230: Deep Learning course (Winter 2018).
Development of functionals for Density Functional Theory Simulation of chemical reactions using first-principles techniques requires a theoretical framework that is able to describe a wide range of electronic interactions. Under the direction of Johannes Voss, I develop new meta exchange-correlation functionals with a semi-empirical approach, fitting the functional form against higher level of theory and experimental benchmark data. By using Bayesian statistics, we enable uncertainty estimation of the computed reaction energies.
The website is very unfinished in terms of its content and the technology underneath. Some features:
Togglable button to turn each page into a flashcard.
Internal links (and lists of pages that link to current page)
Tags, represented by colors:
Definitions
Propositions
Exercises
Examples
Kris’ commentary
Content written in LaTeX: \(1+\frac{\phi}{sin(\psi)}\)
Foot1
Citations [1]
TikZ:
Images:
HTML thrown together via Haskell (source code) with help from Pandoc.
Can be opened and closed. A counter to the right says how many blocks of text are contained underneath.
And, starting
here…
Left | Center | Right |
---|---|---|
Aligned | Aligned | Aligned |
Text | Text | Text |