AutoML

AutoML

An MCP-based autoML platform for data analysis, preprocessing, modeling, and tuning.

9
Stars
2
Forks
0
Releases

Overview

AutoML is an MCP-powered platform that orchestrates end-to-end machine learning workflows, including data analysis, preprocessing, feature engineering, model selection, evaluation, and hyperparameter tuning. It offers data exploration capabilities such as computing dataset statistics (shape, memory usage, data types, missing values), efficient CSV loading with pandas and pyarrow, correlation analysis, and outlier detection. The preprocessing stage automates handling missing values, encoding categorical variables, and scaling numerical features, followed by feature engineering to prepare data for both regression and classification tasks. The platform supports a wide range of algorithms for regression (Linear Regression, Ridge, Lasso, ElasticNet, Random Forest, XGBoost, SVR, KNN, CatBoost) and classification (Logistic Regression, Ridge Classifier, Random Forest, XGBoost, SVM, KNN, Decision Tree, Naive Bayes, CatBoost). Evaluation includes regression metrics (R², MAE, MSE) and classification metrics (Accuracy, F1-Score), with confusion matrix visualization and model comparison. Hyperparameter tuning is automated with advanced search strategies, customizable scoring, and trial management. The project is organized into data samples, a tools module, utilities, and a server entry point (server.py) to run the MCP server, designed for integration with the MCP framework as described in the MCP docs.

Details

Owner
emircansoftware
Language
Python
License
MIT License
Updated
2025-12-07

Features

Data Information & Exploration

Get comprehensive dataset statistics including shape, memory usage, data types, and missing values.

CSV Reading

Efficient CSV file reading with pandas and pyarrow support.

Correlation Analysis

Visualize correlation matrices for numerical and categorical variables.

Outlier Detection

Identify and visualize outliers in datasets.

Automated Preprocessing

Handle missing values, encode categorical variables, and scale numerical features.

Feature Engineering

Prepare features for both regression and classification problems.

Model Training & Evaluation

Support for multiple ML algorithms for regression and classification and evaluation with performance metrics.

Hyperparameter Tuning

Automated hyperparameter optimization with advanced search, customizable scoring, and trial management.

Audience

Data ScientistBuild and evaluate ML models, perform automated preprocessing, and tune hyperparameters using MCP tools.
ML EngineerIntegrate MCP-based workflows into production data analysis pipelines, compare models, and visualize results.
AnalystExplore datasets, validate data quality, and gain insights with automated ML workflows.

Tags

MCPAutoMLData AnalysisPreprocessingFeature EngineeringModel SelectionHyperparameter TuningVisualizationRegressionClassificationXGBoostCatBoostscikit-learnPythonPandasCSV