Research Focus: Week of October 7, 2024
Simplifying secure decision tree training; Improving accuracy of audio content detection; A novel neurosymbolic system for converting text to tables; New video series: AI for Business Transformation; TEE security protections for container workloads.
OmniParser V2
OmniParser is an advanced vision-based screen parsing module that converts user interface (UI) screenshots into structured elements, allowing agents to execute actions across various applications using visual data . By harnessing large vision-language model capabilities,…
MICON (Molecular-Image Contrastive Learning)
This is the repository for paper “Causal integration of chemical structures in self-supervised learning improves representations of microscopy images for morphological profiling”. Learning effective representations of cells in microscopy images can fuel many applications. Here,…