56 open source Python projects

image



1. Flask



This is a micro framework written in Python. It does not have validations for forms and database abstraction level, but allows you to use third-party libraries for common functions. And that is why it is a micro framework. Flask is designed to be easy and fast to build applications, and is also scalable and lightweight. It is based on the Werkzeug and Jinja2 projects. You can learn more about it in the latest DataFlair article on Python Flask .



2. Keras



Keras is an open source neural network library written in Python. It is user-friendly, modular and extensible, and can also work on top of TensorFlow, Theano, PlaidML or Microsoft Cognitive Toolkit (CNTK). Keras has it all: templates, target and transfer functions, optimizers, and more. It also supports convolutional and recurrent neural networks.



Work on the latest Keras-based open source project - Breast Cancer Classification .



EDISON Software - web-development
This article was supported by EDISON Software, which is developing the Vivaldi document repository diagnostic system and investing in startups .




3. SpaCy



This is an open source software library that handles natural language processing (NLP) and is written in Python and Cython. While NLTK is more suitable for training and research purposes, spaCy's job is to provide software for production. In addition, Thinc is a spaCy machine learning library that introduces CNN models for part of speech tags, dependency parsing, and named object recognition.



4. Sentry



Sentry offers open source error monitoring hosting so you can detect and sort errors in real time. Just install the SDK for your language (s) or framework (s) and get started. It allows you to fix unhandled exceptions, examine the stack trace, analyze the impact of each problem, track errors in various projects, assign problems and much more. Using Sentry means fewer errors and more code to send.



5. OpenCV



OpenCV is an open source computer vision and machine learning library. The library has over 2500 optimized algorithms for computer vision tasks, such as object detection and recognition, classification of various types of human activity, tracking movements with the camera, creating three-dimensional models of objects, stitching images to obtain high-resolution images, and many other tasks. The library is available for many languages, such as Python, C ++, Java, etc.



Stars on Github: 39585



Have you already worked on any OpenCV project? Here is one - Gender and Age Draft



6. Nilearn



This is a module for quick and easy implementation of statistical training on NeuroImaging data. It allows you to use scikit-learn for multidimensional statistics for predictive modeling, classification, decoding, and connectivity analysis. Nilearn is part of the NiPy ecosystem, which is a community dedicated to using Python to analyze neuroimaging data.



Github Stars: 549



7. scikit-learn



Scikit-learn is another open source Python project. This is a very famous machine learning library for Python. Often used with NumPy and SciPy, SciPy offers classification, regression and clustering - it supports SVM (Support Vector Machines) , random forests, gradient acceleration, k-tools and DBSCAN. This library is written in Python and Cython.



Stars on Github: 37,144



8. PyTorch



PyTorch is another open source machine learning library written in and for Python. It is based on the Torch library and is great for areas such as computer vision and natural language processing (NLP). It also has a C ++ frontend.



Among many other features, PyTorch offers two high-level features:







Stars on Github: 31,779



9. Librosa



Librosa is one of the best python libraries for analyzing music and audio. It contains the necessary components that are used to obtain information from music. The library is well-documented and contains several guides and examples that will make your task easier.



Github Stars: 3107



Implementation of the open source Python project and Librosa - speech emotion recognition .



10. Gensim



Gensim is a Python library for modeling topics, indexing documents, and finding similarities with large corporations. It targets NLP and the information retrieval community. Gensim is short for "Generate Like." Previously, he created a short list of articles similar to this article. Gensim is clear, efficient and scalable. Gensim implements an efficient and simple implementation of uncontrolled semantic modeling from plain text.



Github Stars: 9,870



11. Django



Django is a high-level Python framework that encourages rapid development and believes in the principle of DRY (don't repeat). This is a very powerful and most widely used Python framework. It is based on the MTV (Model-Template-View) pattern.



Stars on Github: 44,214



12. Face recognition



Face recognition is a popular project on GitHub. It easily recognizes faces and manipulates them using the Python / command line and uses the simplest face recognition library in the world for this. It uses deep training dlib to detect faces with an accuracy of 99.38% in the Wild benchmark test.



Stars on Github: 28,267



13. Cookiecutter



Cookiecutter is a command line utility that you can use to create projects from templates (cookiecutters). One example would be to create a batch project from a batch project template. These are cross-platform templates, and project templates can be in any language or in any markup format, such as Python, JavaScript, HTML, Ruby, CoffeeScript, RST, and Markdown. It also allows you to use multiple languages ​​in the same project template.



Stars on Github: 10,291



14. Pandas



Pandas is a Python data analysis and manipulation library for tagged data structures and statistical functions.



Stars on Github: 21,404



Python open source project to try Pandas - Parkinson's disease detection



15. Pipenv



Pipenv promises to be a production-ready tool aimed at bringing the best of all packaging worlds to the Python world. Its terminal has beautiful colors and combines Pipfile, pip and virtualenv in one command. It automatically creates and manages a virtual environment for your projects and provides users with an easy way to set up a work environment.



Github Stars: 18,322



16. SimpleCoin



This is an implementation of the Blockchain for cryptocurrency created in Python, but it is simple, unsafe and incomplete. SimpleCoin is not intended for production use. Not for production use, SimpleCoin is intended for educational purposes and only to make the blockchain work chain accessible and simplify it. It allows you to save the extracted hashes and exchange them for any supported currency.

Github Stars: 1343



17. Pyray



This is a 3D rendering library written in vanilla Python. It renders 2D, 3D, objects and higher-sized scenes in Python and animations. He finds us in the field of created videos, video games, physical simulations and even beautiful pictures. Requirements for this: PIL, numpy and scipy.



Stars on Github: 451



18. MicroPython



MicroPython is Python for microcontrollers. This is an efficient Python3 implementation that comes with many packages from the Python standard library and is optimized to work on microcontrollers and in cramped conditions. Pyboard is a small electronic board on which MicroPython runs on bare metal, so it can control all kinds of electronic projects.



Stars on Github : 9,197



19. Kivy



Kivy is a Python library for developing mobile applications and other multi-touch applications with a natural user interface (NUI). It has a graphics library, several widget options, an intermediate Kv language for creating your own widgets, support for mouse, keyboard, TUIO, and multi-touch input events. This is an open source library for rapid application development with innovative user interfaces. It is cross-platform, business friendly and has GPU acceleration.



Stars on Github: 9,930



20. Dash



Dash by Plotly is a web application framework. Built on top of Flask, Plotly.js, React, and React.js, it allows us to use Python to build dashboards. It provides Python and R scale models. Dash allows you to create, test, deploy and compile reports without using DevOps, JavaScript, CSS or CronJobs. Dash is powerful, customizable, lightweight and easy to manage. It also has open source code.



Github Stars: 9.883



21. Magenta



Magenta is an open source research project that focuses on machine learning as a tool in the creative process. This allows you to create music and art through machine learning. Magenta is a TensorFlow-based Python library with utilities for working with source data, using it to train machine models and create new content.



22. R-CNN mask



This is an implementation of the R-CNNN mask in Python 3, TensorFlow, and Keras. The model takes each instance of the object on the raster and creates bounding frames and segmentation masks for it. It uses the Feature Pyramid Network (FPN) and the ResNet101 trunk. The code is easy to extend. This project also offers the Matterport3D dataset on reconstructed 3D spaces captured by customers ...

Stars on Github: 14,055



23. TensorFlow Models



This is a repository with various models implemented in TensorFlow - official and research models. He also has samples and teaching aids. Official models use the high-level TensorFlow APIs. Research models are models implemented by TensorFlow researchers to support or support questions and receive inquiries.



Stars on Github: 57,745



24. Snallygaster



Snallygaster is a way to organize problems with project boards. Thanks to this, you can configure the project control panel on GitHub, optimize and automate the workflow. It allows you to sort tasks, plan projects, automate the workflow, track progress, share status and, finally, complete. Snallygaster can scan for secret files on HTTP servers - it looks for files available on web servers that should not be publicly accessible and could pose a security risk.



Github Stars: 1,477



25. Statsmodels



This is a Python package that complements scipy for statistical computing, including descriptive statistics, as well as estimates and conclusions for statistical models. To do this, it has classes and functions. It also allows us to conduct statistical tests and statistical research.

Stars on Github: 4,246



26. WhatWaf



This is an advanced firewall detection tool that we can use to understand if a web application firewall is present. It detects a firewall in a web application and tries to detect one or more workarounds for it on the specified target.



Github Stars: 1,300



27. Chainer



Chainer is a deep learning environment focused on flexibility. It is based on Python and offers differentiated APIs based on the define-by-run approach. Chainer also offers high-level object-oriented APIs for building and training neural networks. It is a powerful, flexible and intuitive structure for neural networks.

Stars on Github: 5.054



28. Rebound



Rebound is a command line tool. When you get a compiler error message, it immediately gets the results from the overflowed stack. To use this, you can use the rebound command to execute your file. This is one of the 50 most popular Python open source projects of 2018. In addition, it requires Python 3.0 or higher. Supported file types: Python, Node.js, Ruby, Golang, and Java.



Github Stars: 2,913



29. Detectron



Detectron performs state-of-the-art object detection (also implements the R-CNN mask). This is Facebook AI Research (FAIR) software written in Python and powered by the Caffe2 Deep Learning platform. Detectron's goal is to provide a high-quality and high-performance code base for object detection research. It is flexible and implements the following algorithms - R-CNN mask, RetinaNet, faster R-CNN, RPN, faster R-CNN, R-FCN.



Stars on Github: 21,873



30. Python-fire



This is a library for automatically generating CLIs (command line interfaces) from (any) Python object. It also allows you to develop and debug code, as well as examine existing code or turn someone else's code into a CLI. Python Fire facilitates the transition between Bash and Python, and also facilitates the use of REPL.

Stars on Github: 15,299



31. Pylearn2



Pylearn2 is a machine learning library built primarily on Theano. Its purpose is to facilitate the study of ML. Allows you to write new algorithms and models.

Stars on Github: 2,681



32. Matplotlib



Matplotlib is a 2D drawing library for Python - it generates quality publications in various formats.



Stars on Github: 10,072



33. Theano



Theano is a library for manipulating mathematical and matrix expressions. It is also an optimizing compiler. Theano uses NumPy-like syntax to express computations and compiles them to work on CPU or GPU architectures. This is an open source Python machine learning library written in Python and CUDA and running on Linux, macOS and Windows.



Stars on Github : 8,922



34. Multidiff



Multidiff is designed to facilitate the understanding of machine-oriented data. It helps to see the differences between a large number of objects, making differences between the corresponding objects and then displaying them. This visualization allows us to search for patterns in our own protocols or unusual file formats. It is also mainly used for reverse engineering and binary data analysis.



Stars on Github: 262



35. Som-tsp



This project focuses on the use of self-organizing maps to solve the traveling salesman problem. Using SOM, we find non-optimal solutions for the TSP problem and use the .tsp format for this. TSP is an NP-complete problem, and as the number of cities grows, it becomes increasingly difficult to solve.



Github Stars: 950



36. Photon



Photon is an exceptionally fast web scanner designed for OSINT. It can retrieve URLs, parameter URLs, Intel information, files, private keys, JavaScript files, regular expression matches, and subdomains. The extracted information can then be saved and exported in json format. Photon is flexible and brilliant. You can also add some plugins to it.



Github Stars: 5714



37. Social Mapper



Social Mapper is a social media mapping tool that correlates profiles using face recognition. He does this on various websites on a large scale. Social Mapper automates the search for names and photos on social networks, and then tries to pinpoint and group someone’s presence. He then creates a report for human verification. This is useful in the security industry (e.g. phishing). It supports LinkedIn, Facebook, Twitter, Google Plus, Instagram, VKontakte, Weibo and Douban.



Stars on Github: 2,396



38. Camelot



Camelot is a Python library that helps you extract tables from PDF files. It works with text PDF files, but not with scanned documents. Here, each table is a pandas DataFrame. In addition, you can export tables to .json, .xls, .html or .sqlite.



Github Stars: 2415



39. Lector



This is a Qt reader for e-books. It supports .pdf, .epub, .djvu, .fb2, .mobi, .azw / .azw3 / .azw4, .cbr / .cbz and .md file formats. Lector has a main window, view a table, view books, view without distractions, support for annotations, view comics and a settings window. It also supports bookmarks, profile browsing, a metadata editor, and a built-in dictionary.



Github Stars: 835



40. m00dbot



This is a Telegram bot for self-testing depression and anxiety.



Github Stars: 145



41. Manim



This is an animation engine for explaining math videos that you can use to create precise animations programmatically. For this, he uses Python.



Stars on Github: 13,491



42. Douyin-Bot



A bot written in Python for a Tinder-like application. Developers from China.



Github Stars: 5.959



43. XSStrike



This is a cross-site scripting discovery package with four hand-written parsers. It is also equipped with an intelligent payload generator, a powerful fuzzing engine and an incredibly fast search engine. Instead of entering useful data and checking its operation, like all other tools, XSStrike recognizes the answer using several analyzers and then processes the useful data, which is guaranteed to work using context analysis integrated into the fuzzing mechanism.



Github Stars: 7050



44. Python Robotics



This project is a collection of code in the algorithms of Python-robotics, as well as autonomous navigation algorithms.



Stars on Github: 6,746



45. Google Images Download



Google Images Download is a command-line Python program that searches for keywords in Google images and gets images for you. This is a small program without dependencies if you only need to upload up to 100 images for each keyword.



Github Stars: 5749



46. ​​Trape



Allows you to track and execute intelligent social engineering attacks in real time. This helps to figure out how large Internet companies can receive confidential information and control users without their knowledge. Trape can also help track down cybercriminals.



Stars on Github: 4256



47. Xonsh



Xonsh is a cross-platform Unix-gazing Python command-line language and command-line shell. This is a Python 3.5+ superset with additional shell primitives such as in Bash and IPython. Xonsh runs on Linux, Max OS X, Windows, and other major systems.



Stars on Github: 3426



48. GIF for CLI



This requires a GIF or a short video or request, and using the Tenor GIF API, it is converted to ASCII animated graphics. It uses ANSI escape sequences for animation and color.



Github Stars: 2,547



49. Cartoonify



Draw This is a polaroid camera capable of drawing cartoons. It uses a neural network for object recognition, a Google Quickdraw dataset, thermal printer and Raspberry Pi. Quick, Draw! - This is a Google game in which players are invited to draw an image of an object / idea, and then he tries to guess what he represents in less than 20 seconds.



Github Stars: 1,760



50. Zulip



Zulip is a real-time group chat application and productive with multi-threaded conversations. Many Fortune 500 companies and open source projects use it for live chat, which can process thousands of messages per day.



Stars on Github: 10,432



51. YouTube-dl



This is a command line program that can download videos from YouTube and some other sites. It is not tied to a specific platform.



Github Star Rating: 55,868



52. Ansible



This is a simple IT automation system that can handle the following functions: configuration management, application deployment, cloud initialization, special tasks, network automation, and multi-node orchestration.



Stars on Github: 39,443



53. HTTPie



HTTPie is a command line HTTP client. This simplifies CLI interaction with web services. For the http command, it allows us to send arbitrary HTTP requests with simple syntax, and get color output. We can use it for testing, debugging and interacting with HTTP servers.



Stars on Github: 43,199



54. Tornado Web Server



This is a web framework, an asynchronous network library for Python. It uses non-blocking network inputs / outputs to scale to over thousands of open connections. This makes it a good choice for long queries and WebSockets.



Github Stars: 18,306



55. Requests



Requests is a library that makes it easy to send HTTP / 1.1 requests. You do not need to manually add parameters to URLs or encode PUT and POST data.

Stars on Github: 40,294



56. Scrapy



Scrapy is a fast, high-level web page browsing framework - you can use it to browse websites to extract structured data. You can also use it for data analysis, monitoring and automated testing.



Stars on Github: 34,493



All Articles