Tools Directory
45+ Data, AI, Dev and Cloud tools with official documentation.
45+
tools
9
categories
100%
Official docs
2026
Updated
43 tools displayed
Python
Essential programming language for data and AI. Rich ecosystem of scientific and ML libraries. The absolute market standard.
dbt
SQL data transformation tool to model, test and document data in a warehouse. Standard of the modern data stack.
Apache Airflow
Data workflow orchestration platform. Enables scheduling, monitoring and managing complex data pipelines.
TensorFlow
Google's open-source framework for machine learning and deep learning. Used to train and deploy models at scale.
AWS
Amazon's global cloud market leader. Over 200 managed services covering compute, storage, data and AI.
Google Cloud Platform
Google's cloud platform with a strong data and AI focus. BigQuery, Vertex AI and reference analytical services.
Snowflake
Cloud-native data warehouse separating compute and storage. Modern data stack standard for large-scale analytics.
Grafana
Open-source visualisation and monitoring platform. Create real-time dashboards for metrics and logs.
Streamlit
Python framework to build data and AI web apps without front-end skills. Ideal for rapid dashboard and AI demo prototyping.
Cursor
AI-powered IDE for coding with LLM assistance. Significantly improves developer productivity.
Claude (Anthropic)
Anthropic's LLM recognised for reliability, safety and advanced analytical capabilities. Excellence in reasoning and code.
OpenAI
Leading AI platform with GPT-4, DALL-E and Whisper. Industry standard API to integrate LLM capabilities into applications.
Mistral AI
Performant French open-source LLM. Sovereign alternative to American models, deployable on-premise.
Microsoft Azure
Microsoft's cloud platform integrated with the enterprise ecosystem. Azure OpenAI Service for secure LLM deployments.
Figma
Industry standard collaborative design tool. Used to design interfaces for data and AI products.
Notion
All-in-one workspace for documentation, project management and knowledge base. Widely used in data teams.
Miro
Collaborative whiteboard for architecture design, design thinking workshops and system mapping.
Docker
Containerisation platform ensuring application portability. Absolute standard for data and AI system deployment.
Apache Kafka
Distributed streaming platform to process very high-volume real-time data streams.
SQL
Standard query language for relational databases. Fundamental skill for any data profile.
Scikit-learn
Classic Python machine learning library. Reference for predictive modelling, classification and clustering.
LangChain
LLM orchestration framework to build composable AI applications: RAG, agents, complex processing chains.
Hugging Face
Central hub for open-source AI models. Access, fine-tune and deploy thousands of ML and LLM models.
Chroma
Open-source local-first vector database. Reference solution for rapidly prototyping RAG systems.
Prometheus
Open-source monitoring and alerting system. Standard for collecting and querying real-time performance metrics.
SAP
Reference enterprise ERP suite. Present in most large organisations as a data source for data projects.
Salesforce
World-leading CRM. Key data source for sales and marketing-oriented data and AI projects.
Canva
Accessible graphic design tool integrating AI features. Create visuals and presentations for data teams.
Runway
Generative AI video and image creation platform. Reference for multimedia visual content generation.
LangGraph
LangChain extension to build AI agents with complex state graphs. Emerging standard for agentic orchestration.
LangSmith
Observability platform dedicated to LLM systems. Tracing, evaluation and debugging of AI applications in production.
LiteLLM
Unified proxy to call 100+ LLMs with a standardised API. Simplifies LLM routing and FinOps in production.
FastAPI
Modern Python framework to build high-performance APIs. Standard for exposing AI models and data services.
Kubernetes
Standard container orchestrator to deploy and scale applications in production. Essential for large-scale AI deployments.
PostgreSQL
Most advanced open-source relational database. Supports pgvector for embeddings. Standard for data applications.
Databricks
Unified lakehouse platform combining data engineering, ML and analytics. Built on Apache Spark.
Terraform
Infrastructure as code to provision and manage cloud resources declaratively and reproducibly.
MLflow
Open-source platform for ML lifecycle management: experiment tracking, model registry and deployment.
Pinecone
Managed cloud vector database. Production solution for large-scale RAG systems requiring high availability.
n8n
Open-source and self-hostable automation platform. Create complex workflows with or without code.
Make
No-code visual automation platform to connect applications and automate processes without writing code.
Weights & Biases
MLOps platform to track ML experiments, visualise metrics and collaborate on AI models.
Gamma
AI-powered presentation and document creation tool. Generates professional slides from a simple prompt.