Analytics platforms, data pipelines, BI tools, and data science frameworks with AI.
100 tools
Databricks is a data insights solution for unified analytics platform. Key capabilities include collaborative notebooks for data science and engineering, real-time data processing with apache spark, and machine learning model management and deployment.
HelixDB combines graph and vector capabilities in a single database, enabling both relationship queries and semantic search. Built for teams building AI applications, recommendation engines, and knowledge graphs at scale.
Modern business intelligence platform built on AI-native architecture. Query data with natural language, build interactive dashboards, and share insights across teams.
Data security platform that monitors third-party GenAI tools for unauthorized data exposure. Scans LLM usage for compliance violations, audits data transfers, and blocks high-risk AI tool usage in real-time.
Data labeling and annotation for LLM training datasets. Provides crowdsourced and expert labeling, quality control, and integration with Hugging Face. Used by ML teams to prepare domain-specific training data for fine-tuning.
BIOS Health decodes neural signals from implantable and wearable devices, converting real-time nerve data into actionable clinical insights. It uses machine learning to interpret neural patterns, integrates with EHRs, and provides a dashboard for clinicians to monitor patient status and adjust treatment.
Neural network platform for brain research and clinical applications. Processes neuroimaging data (fMRI, EEG, MEG) with deep learning models to generate brain maps, predict treatment responses, and support clinical decision-making. Used by neuroscience labs, hospitals, and research institutions.
Basedash is a database client that generates BI dashboards from your existing SQL data without code. Query any PostgreSQL, MySQL, or MongoDB database to build custom reports, track metrics, and collaborate with non-technical stakeholders in minutes.
LanceDB is an open-source vector database engineered for similarity search at scale. It provides serverless compute, sub-millisecond latency, and native integration with Python ML frameworks. AI engineers use it to store embeddings, power semantic search, and serve RAG and recommendation pipelines without managing infrastructure.
Automorphic is an AI framework for training domain-specific language models with minimal data. Embed specialized knowledge into models without large labeled datasets. Includes real-time collaboration and performance analytics.
Great Expectations is an open-source data quality framework that validates data pipelines through testable expectations. Define checks as code, run them continuously against production data, and get documentation and data profiling reports. Used by data engineers to prevent bad data from reaching ML models and dashboards.
Sciloop is an AI co-scientist that automates machine learning experiment management and analysis. It tracks hyperparameters, metrics, and datasets across experiments, compares model performance side-by-side, and provides automated insights on what drives accuracy. Integrates with TensorFlow, PyTorch, and scikit-learn. Built for data scientists to eliminate manual tracking and speed up iteration.
dbt is a data insights solution for data transformation tool. Key capabilities include sql-based data transformation, version control for data models, and automated testing of data transformations.
Fivetran is a data insights solution for automated data integration. Key capabilities include automated data connectors, real-time data replication, and schema migration.
High-resolution spatial data platform for robotics and machine learning. Collects 3D environmental data, processes it in real-time, and provides customizable datasets via API. Powers perception models for autonomous systems and robotic applications.
Monte Carlo is a data observability platform that monitors data pipelines for quality issues, freshness delays, and schema changes. Detects anomalies in volume, distribution, and integrity, then alerts teams before broken data reaches downstream systems.
camelAI is an AI-powered business intelligence platform that analyzes data in real-time and generates actionable insights. It features customizable dashboards, predictive analytics, automated reporting, and natural language query processing. Teams use it to identify trends, make data-driven decisions, and automate report generation.
Unsiloed AI parses unstructured data (documents, images, audio, video) using multimodal AI APIs. Extract structured data from messy sources in real time. Integrates directly into your data pipeline with analytics and reporting.
AWS (Amazon Web Services) is a comprehensive cloud computing platform providing alternatives to traditional infrastructure through 200+ services. It offers scalable computing via EC2, managed databases through RDS, serverless functions with Lambda, and content delivery via CloudFront. Designed for developers, startups, and enterprises seeking flexible cloud infrastructure without managing physical servers.
Trainy manages GPU clusters for AI/ML workloads. It automates GPU resource allocation, provides real-time monitoring, and supports multi-cloud model deployment. The dashboard allows cluster management for AI/ML model training and workload orchestration.
Panels collects high-quality audio data for AI labs. It offers customizable datasets, real-time analysis, and a user-friendly interface for data management. API access integrates with existing workflows.
Nao Labs is an open-source analytics agent that collects and analyzes real-time data from multiple sources. It provides customizable dashboards for tracking customer behavior, marketing campaign performance, and sales metrics with flexible integration options.
Cinder automates data labeling and provides AI bias detection for machine learning models. Includes fairness testing, model evaluation tools, and collaboration workflows for data annotation.
Kater.ai is a business intelligence tool. It processes natural language for data queries and provides real-time data visualization. Features include customizable dashboards, collaboration tools, automated reporting, and alerts. It generates business reports, analyzes market trends, and monitors financial performance.
Data orchestration platform with software-defined assets. Declarative approach to building, testing, and monitoring data pipelines with built-in lineage.
Strand AI curates multimodal biological datasets (genomics, proteomics, imaging) optimized for AI model training. Researchers access integrated datasets, visualize complex biology, and collaborate on data annotation for drug discovery and biomarker studies.
Louiza Labs synthesizes clinical trial data, regulatory filings, and scientific literature to accelerate pharmaceutical research decisions. It provides market sizing, competitive intelligence, and risk assessment for drug candidates in weeks instead of months.
Aluna provides curated biomedical datasets and AI-driven analysis tools for healthcare researchers and ML engineers. It supports machine learning workflow integration, real-time data updates, custom reporting, and collaborative research project management.
Tableau is a business intelligence platform that converts raw data into interactive dashboards and reports. Sales, marketing, and operations teams use it to visualize metrics, identify trends, and monitor KPIs in real-time.
Sieve combines AI algorithms and human review for data cleaning. It offers API access, an Excel plugin, and real-time validation. Use it to clean large datasets, ensure accuracy, and integrate data from multiple sources.
Looker is a data insights solution for business intelligence platform. Key capabilities include data exploration and visualization, customizable dashboards, and real-time data analytics.
Maven Bio analyzes life sciences data using AI. It provides real-time insights, reporting, and customizable dashboards. The platform includes collaboration tools and integrates with LIMS. Use cases include drug discovery, clinical trial analysis, and biomarker identification.
We are disrupting the consulting industry with hirable AI analysts for strategy and corporate finance work, starting with Excel - where half the work happens.
Voker provides real-time performance tracking and customizable dashboards for AI agents. It generates automated reports and integrates with popular AI frameworks. The tool is designed for monitoring AI agent performance and identifying areas for improvement.
Monarcha is a spatial data platform combining AI analysis with geospatial visualization. It processes satellite imagery, climate data, and location-based datasets to generate predictive insights. Customizable dashboards and APIs enable automated reporting for urban planning, environmental monitoring, and logistics optimization.
Evidently AI monitors ML models in production, detecting data drift and model degradation before they impact performance. It visualizes metrics across ML frameworks and enforces fairness checks for regulatory compliance.
Ardis AI automates text analysis and extraction, converting unstructured data into searchable knowledge graphs. It integrates with existing data sources, provides dynamic visualization, and enables advanced natural language search. Teams use it to make text data accessible and queryable without manual processing.
Redbird provides AI-driven analytics. It offers real-time data visualization, predictive analytics via machine learning, customizable dashboards/reports, and collaboration tools. Integrates with various data sources. Use cases include sales forecasting, customer segmentation, and market trend analysis.
Scale AI provides data labeling, annotation, and curation services for machine learning teams. It automates data pipeline management, quality assurance, and validation at enterprise scale. Teams use it to prepare production-quality training datasets faster without manual annotation bottlenecks.
Novaflow analyzes biological experiment data with real-time insights and automated reporting. The platform integrates with lab instruments, provides data visualization dashboards, and enables team collaboration on research findings.
Zeit AI lets business users query datasets using plain English instead of SQL, automatically generating visualizations and reports. It powers real-time dashboards and KPI monitoring for data teams and executives who need insights without writing queries.
Lotas provides AI tools for data science and 3D modeling. It offers automated data analysis, 3D visualization, and collaboration features. Customizable dashboards and integration with data storage solutions are included. Use cases include creating 3D models from data and automating analysis workflows.
AskYourDatabase enables natural language queries across multiple database types, eliminating the need for SQL knowledge. It provides real-time data visualization and custom dashboards for analytics and reporting.
MindsDB is an AI platform that automates machine learning and integrates with multiple data sources for real-time predictions. It provides a unified interface for connecting disparate data sources, training models, and generating AI-driven insights without manual ML engineering.
Collaborative data annotation platform for machine learning teams. Provides version control, customizable workflows, and integration with popular ML frameworks. Designed for data scientists and research teams to label training data efficiently.
DeepGrove provides AI-driven real-time data analytics and insights. It offers customizable dashboards, collaboration tools, cross-device compatibility, and predictive analytics for analyzing complex datasets.
Instantly transform your data into actionable insights without coding. Dot analyzes complex datasets and generates clear, visual reports for business teams, enabling faster, data-driven decision-making across departments.
Lightly automates data labeling and collaborative annotation for ML teams. Integrates with popular ML frameworks and provides real-time data insights. Used by machine learning engineers and data scientists to improve dataset quality and accelerate model training.
Sarus enables analytics and machine learning on personal data while preserving privacy through anonymization and data governance. It includes real-time analytics dashboards, ML model deployment, and API integration for secure data sharing.
Chamber automates AI model deployment and production monitoring. It handles model scaling, performance tracking, and infrastructure management. The platform integrates with existing ML pipelines and provides a dashboard for observability.
Sureform collects, annotates, and validates human data for robotics and embodied AI systems. It provides real-time data validation and integrates with ML frameworks for training models on human motion and manipulation tasks.
Power BI is a data insights solution for microsoft business analytics. Key capabilities include interactive data visualization, real-time dashboard updates, and customizable reports and analytics.
Findly is an AI co-pilot for business intelligence. It translates natural language questions into queries, generates reports, and builds dashboards automatically from your data.
Aquarium Learning helps machine learning teams improve model performance by assessing and enhancing dataset quality. It offers tools for bias identification, automated augmentation, and version control, facilitating collaboration and reproducibility.
CellChorus combines microscopy imaging with machine learning to analyze single-cell performance and interactions. Used by biotech and pharmaceutical researchers to accelerate drug discovery and understand cellular behavior.
Hightouch is a data insights solution for data activation platform. Key capabilities include reverse etl capabilities, real-time data syncing, and customizable data transformations.
Ecliptor provides NLP and customizable embedding models for analyzing large unstructured datasets in real-time. It connects to popular data sources via API, enabling analysis, visualization, and model building on text-heavy data.
Encord provides data annotation tools for machine learning teams with collaboration features, dataset version control, and integration with ML frameworks. It manages data storage, supports model training workflows, and enables real-time data analysis.
Segment is a customer data platform that collects, unifies, and routes customer data from multiple touchpoints to analytics tools, marketing platforms, and data warehouses. It enables businesses to create a single source of truth for customer information while ensuring data consistency across their entire technology stack.
IOMETE is a self-hosted data lakehouse designed for the AI era, ensuring data ownership, privacy, and cost efficiency while providing flexible deployment options.
Provides advanced rerankers and embeddings for semantic search and retrieval across documents, websites, and databases. Delivers human-level accuracy and speed for developers, enterprises, and platforms needing precise information retrieval at scale.
PandasAI is an open-source data-integration platform. It processes natural language for data queries, integrates with data sources, and offers real-time data visualization. The tool provides customizable reporting and collaboration features.
Evidence is a powerful B2B SaaS tool that enables users to build, version control, and publish data products using SQL, markdown, and AI, enhancing the data analytics experience.
Chonkie is an open-source data ingestion tool for AI. It supports real-time data ingestion from multiple sources, offers customizable data transformation pipelines, and integrates with AI frameworks. A dashboard is included for monitoring.
Reworkd automates web data extraction for sales, marketing, and research teams. Extract prospect information, competitive intelligence, and market data at scale without technical expertise.
Data Driven Bioscience provides rapid cancer genomics profiling with DNA and RNA sequencing completed in 2 days. Reports integrate directly into EHRs, enabling clinicians to access genomic insights in their existing workflows. Used for diagnosis, treatment planning, and monitoring.
David AI provides high-quality audio datasets and real-time audio processing for training and deploying audio AI models. It enables building speech recognition systems, voice activity detection, and acoustic analysis.
Roamaround uses AI to map and visualize data with real-time collaborative dashboards. Project teams track progress, customize views, and share insights through interactive visualizations integrated with productivity tools.
HouseCanary provides AI-driven property valuation and market analysis across 130+ million properties. Uses 1,000+ data points per property to generate institutional-grade valuations and risk assessments for real estate investors.
Buster is an analytics engineering platform that automates data pipeline creation, monitors data quality, and generates custom dashboards. It enables data teams to build reliable analytics infrastructure and surface AI-driven insights from connected data sources.
Artificial Societies simulates entire populations and their interactions using AI, enabling organizations to test policies, forecast outcomes, and understand complex social dynamics. Used for urban planning, policy evaluation, economic forecasting, and research.
Secoda is a data enablement platform that helps modern data teams centralize, document, and govern their data assets while enabling self-service access for business users.
Ocular AI processes real-time data for LLMs and offers computer vision capabilities. It integrates with enterprise systems and provides customizable AI models. A user-friendly dashboard visualizes data.
Shaped is a real-time search and recommendation engine for feeds, searches, and AI agents. It indexes data from multiple sources with customizable ranking algorithms and fast query processing.
Mica replaces humans in fixing bad data by deploying AI agents that resolve non-happy path errors in data pipelines. It connects to your tech stack, uses business context to handle exceptions, and ensures scalable, auditable, and cost-effective data operations.
Logital AI (Teclada) compares large language models and AI systems with noise-reduction algorithms for unbiased evaluation. It visualizes model output in real-time, provides customizable reporting, and benchmarks models on accuracy, latency, and cost metrics.
VitalStrata is an analytics tool for accurate risk adjustment. It automates risk adjustment analysis, validates data in real-time, and provides predictive analytics for patient outcomes. It offers comprehensive reporting and integrates with EHRs. Use cases include ensuring accurate risk adjustment for value-based care, monitoring data integrity, and identifying compliance risks.
Dartboard Energy provides real-time and predictive analytics for electricity markets. It offers customizable dashboards, reporting, and automated alerts for market fluctuations. The tool integrates with energy trading platforms.
Mundo AI provides curated multilingual training data for building and improving AI models. It offers customizable datasets, real-time updates, quality assurance, and API access for easy integration into model training workflows.
Parsagon tracks and analyzes public policy using AI. It provides real-time tracking, customizable alerts for policy changes, and data visualization. Features include AI-driven analysis, collaboration tools, and policy summaries.
No-code data integration platform for cleaning, mapping, and importing data from multiple sources. AI helps identify data issues and suggests transformations. Used by analysts and business teams to reduce manual data prep work and improve data quality before analysis.
The Synthesis Company uses AI to accelerate scientific evidence synthesis and literature reviews. It helps researchers analyze and summarize medical and academic research 100x faster.
At Invert, we’re uniting cutting-edge technology and AI with the science of bioprocessing to accelerate therapies and sustainable bioproducts worldwide. Join us on a mission that matters.
PromptLoop automates B2B data collection from multiple sources. It offers customizable AI models for specific dataset needs, real-time data processing, and analysis. Manage datasets via a user-friendly interface and integrate with data visualization tools.
Iambiq Technologies automates text-based data processing. It extracts and analyzes text data from documents using Natural Language Processing. Features include customizable workflows, real-time analytics, and integration with existing data management systems. It processes large volumes of text data efficiently.
Datasaur manages data labeling workforces for NLP. It provides automated workflows, real-time collaboration, and quality assurance for labeled data. Features include customizable interfaces and integration with NLP frameworks.
Datafold automates data engineering workflows with AI-driven quality checks and anomaly detection. It integrates with data warehouses, visualizes data lineage for compliance, and includes collaboration tools to help data teams maintain data accuracy and trust.
Eventual is an AI data engine designed for processing data across any modality and scale. It enables building and managing customizable data pipelines, real-time analytics, and integrates with existing tools to handle large datasets efficiently.
Menza automates data analysis and reporting. It provides real-time insights from large datasets, customizable dashboards, and AI-driven predictive analytics. It also cleans and prepares data, detects anomalies, and facilitates team collaboration.
Honeydew is a semantic layer that creates a single source of truth for data across BI and AI platforms. It standardizes data definitions, enforces governance policies, and enables self-service analytics. Used by analytics teams to ensure data consistency and reduce query errors.
Narrator is a data preparation platform for data integration. It offers automated data preparation, natural language querying, data visualization, and team collaboration. It integrates with popular data sources to prepare data for analysis, generate real-time insights, and create custom data reports.
SID automates data retrieval using AI algorithms that connect to multiple data sources. It provides real-time analytics, customizable search parameters, and reporting tools designed for non-technical users analyzing large datasets.
NanoNets provides AI-powered automatic data extraction for businesses, enabling seamless processing of documents and real-time data management. Its customizable templates and cloud integrations streamline workflows and enhance data accessibility.
DataSuite manages datasets for AI trainers. It automates data cleaning and preprocessing, provides version control, and includes collaboration tools. It integrates with AI frameworks and offers real-time analytics. Use it for managing large datasets and training AI models.
HyperGlue applies natural language processing to analyze text data from multiple sources and extract business insights. It provides sentiment analysis, customizable reporting, and real-time visualization dashboards for data-driven decision making.
Klarity is an AI analytics platform that generates insights from business data and provides personalized recommendations. Teams use dashboards and automated reports to make faster, data-driven decisions.
QueryPie AI automates data analysis and business process workflows through AI agents. Data teams use it to generate real-time SQL queries, build custom dashboards, and automate repetitive analysis tasks while maintaining security and compliance controls.
Frekil integrates real-world clinical data from multiple sources into a queryable intelligence layer. Pharmaceutical companies and healthcare providers use it to generate real-world evidence, support regulatory submissions, and optimize clinical outcomes.
Pipekit automates data pipeline orchestration for CI/CD tools. It processes data in real-time, integrates with existing CI/CD tools, and scales for enterprise data needs. Monitor and manage data workflows via a dashboard.
PhantomBuster is a SaaS tool that automates data extraction and lead generation from various platforms, providing unique intent data that helps build effective sales pipelines. It enables users to find fresh leads, enrich their lists, and send personalized outreach messages using AI.