Skip to content

Latest commit

 

History

History
103 lines (80 loc) · 4.7 KB

sapientml.adoc

File metadata and controls

103 lines (80 loc) · 4.7 KB

SapientML Project Proposal

  • Name of project: SapientML

  • Requested project maturity level: Sandbox

  • Project description: Automatic machine learning, commonly referred to as AutoML, holds great promise in democratizing the utilization of machine learning (ML) by automating a large portion of the work typically performed by data scientists. However, the vast search space of potential pipelines poses a challenge, often resulting in suboptimal or no pipelines being generated, particularly when dealing with large and complex datasets. SapientML addresses this issue by leveraging a collection of pre-existing datasets and their human-created pipelines, enabling efficient generation of high-quality pipelines for new datasets with predictive tasks. With SapientML, data scientists can rapidly create and amend AI models, as the code is provided along with detailed explanations. Furthermore, citizen data scientists can easily create the desired AI models as well.

  • Statement on alignment with LF AI’s mission.: AutoML plays a vital role in democratizing AI and has the potential to extend the benefits of AI to a wider audience.

  • Have you identified possible collaboration opportunities with current LF AI hosted projects?

    • Pre-processing

      • Amundsen

      • Feast

      • Feathr

    • Modelling

      • Adversarial Robustness Toolbox

      • AI Explainability 360

      • AI Fairness 360

      • Intersectional Fairness

    • Deployment

      • Acumos

    • Coding

      • Elyta

      • Kedro

  • License: (name, version, and URL to license text): Apache License, Version 2.0

  • Source control:

  • Does the project sits in its own GH organization?: Yes

  • Do you have the GH DCO app active in the repos?: No

  • Issue tracker: https://github.com/sapientml/sapientml/issues

  • Collaboration tools: GH issues

  • External dependencies: (including licenses (name and version) of those dependencies.)

    • numpy, ^1.19.5, BSD 3-Clause "New" or "Revised" License

    • pandas, ^1.5.3, BSD 3-Clause "New" or "Revised" License

    • scikit-learn,1.1.3, BSD 3-Clause "New" or "Revised" License

    • scipy, ^1.8.1, BSD 3-Clause "New" or "Revised" License

    • tqdm, ^4.61.1, Mozilla Public Licence (MPL) v. 2.0

    • Jinja2, ^3.0.3, BSD 3-Clause "New" or "Revised" License

    • nbconvert, ^7.4.0, BSD 3-Clause "New" or "Revised" License

    • numba, ^0.56.4, BSD 2-Clause "Simplified" License

    • pydantic, ^1.10.6, MIT License

    • mecab-python3, ^1.0.6, BSD 3-Clause "New" or "Revised" License

    • fasttext-wheel, ^0.9.2, MIT License

    • python-dateutil, ^2.8.2, Apache License 2.0

    • xgboost, ^1.7.4, Apache License 2.0

    • matplotlib, ^3.7.1, Python-2.0(Python License 2.0)

    • matplotlib-inline, ^0.1.2, BSD 3-Clause "New" or "Revised" License

    • japanize-matplotlib, ^1.1.3, MIT License

    • seaborn, ^0.12.2, BSD 3-Clause "New" or "Revised" License

    • nltk, ^3.6.2 ,Apache License 2.0

    • optuna, ^2.8.0, MIT License

    • catboost, ^1.0.6, Apache License 2.0

    • astor, ^0.8.1, BSD 3-Clause "New" or "Revised" License

    • graphviz, ^0.20.1, MIT License

    • nbformat, ^5.1.3, BSD 3-Clause "New" or "Revised" License

    • toml, ^0.10.2, MIT License

    • lightgbm, ^3.3.5, MIT License

    • imbalanced-learn, ^0.10.1, MIT License

    • shap, ^0.41.0,MIT License

    • statsmodels, ^0.13.5, BSD 3-Clause "New" or "Revised" License

    • patsy, ^0.5.3, BSD 2-Clause "Simplified" License

    • category-encoders, ^2.5.1.post0, BSD 3-Clause "New" or "Revised" License

    • libcst,*, MIT License

    • astunparse, ^1.6.3,Python-2.0(Python License 2.0)

  • Initial committers:

  • Have the project defined the roles of contributor, committer, maintainer, etc.? : No (work in progress)

  • Total number of contributors to the project including their affiliations.:9

  • Does the project have a release methodology?: No (work in progress)

  • Does the project have a code of conduct?: No (work in progress)

  • Did the project achieve any of the CII best practices badges? : No

  • Do you have any specific infrastructure requests needed as part of hosting the project in the LF AI?: No

  • Project website - Do you have a web site?: No (Landing page will be provided)

  • Project governance - Do you have a working governance model for the project?: No

  • Social media accounts - Do you have any Twitter/LinkedIn/Facebook/etc. project accounts?: No

  • Existing sponsorship: Fujitsu