I have 118 projects on Github
Streaming System 相关的论文读物
Port of LevelDB to Java
Kubernetes中文指南/云原生应用架构实践手册 - https://jimmysong.io/kubernetes-handbook
Mirror of Apache Zeppelin
Wormhole is a SPaaS (Stream Processing as a Service) Platform
A large annotated semantic parsing corpus for developing natural language interfaces.
Vespa - the open big data serving engine
Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover, seamlessly and without downtime.
Computation using data flow graphs for scalable machine learning
Tabix.io UI
An edge-native container management system for edge computing
suning-ios 苏宁易购 ios 学习源码
Build Spark Batch/Streaming/MLlib Application by SQL
Stream summarizer and cardinality estimator.
A fast JDBC connection pool, based on Stormpot: http://chrisvest.github.io/stormpot/
A trident state implementation using MySQL JDBC driver
A HBase connector for Storm
Storm Elastic Search Bolt
A collection of spouts, bolts, serializers, DSLs, and other goodies to use with Storm
Spark 2.0 Python Machine Learning examples
spark ml 算法原理剖析以及具体的源码实现分析
杭州第六次 Spark & Flink Meetup
Apache Spark - A unified analytics engine for large-scale data processing
Serverless Google Cloud Functions Plugin – Adds Google Cloud Functions support to the Serverless Framework
⚡ Serverless Framework – Build web, mobile and IoT applications with serverless architectures using AWS Lambda, Azure Functions, Google CloudFunctions & more! –
Alibaba's MQ, also aliyun ONS.
Example code from the book
React@16, react-router@4, redux and webpack@4 starter project
Node.Js、Golang、Machine Learning、PostgreSQL、Deep Learning
Elastic data processing with Apache Pulsar and Apache Flink
The official home of the Presto distributed SQL query engine for big data
Pentaho Data Integration ( ETL ) a.k.a Kettle
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
A scalable, distributed Time Series Database.
A delightful community-driven (with 1,000+ contributors) framework for managing your zsh configuration. Includes 200+ optional plugins (rails, git, OSX, hub, capistrano, brew, ant, php, python, etc), over 140 themes to spice up your morning, and an auto-update tool so that makes it easy to keep up with the latest updates from the community.
newegg nair client sdk for nodejs
Jekyll Themes / GitHub Pages 博客模板 / A template repository for Jekyll based blog
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/
Open source platform for the complete machine learning lifecycle
Metl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file based Extract/Transform/Load (ETL), and remote procedure invocation via Web Services. Read more at www.jumpmind.com/products/metl/overview
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
推荐一个学习C++的方法,适合有一定基础的同学。先读effective c++,一天能搞定(从c转读第二版,从java等转读第三版),然后读google c++ style。再是看leveldb代码(http://t.cn/aYyqjo 多谢@apc2 推荐),Sanjay和Jeff所写,简短完备,非常优美,完美阐述前两者所列的原则。
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
Scalable Java Redis client
Advanced Redis client for thread-safe sync, async, and reactive usage. Supports Cluster, Sentinel, Pipelining, and codecs.
Jekyll theme.
leetcode questions and java code implement
学习 Spark ML
基于Spark的LambdaMART实现
Java client for Kubernetes & OpenShift
Kubernetes Native Edge Computing Framework (project under CNCF)
Enterprise Stream Process Engine
「Java学习+面试指南」一份涵盖大部分Java程序员所需要掌握的核心知识。
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Wombat Color Theme for iTerm2
Apache Doris(Incubating) is an MPP-based interactive SQL data warehousing for reporting and analysis.
Apache Iceberg
Upserts, Deletes And Incremental Processing on Big Data.
HMFAYSAL OMEGA is a minimalist, beautiful, responsive theme for Jekyll designed for writers who want their content to take front and center.
Data.Blog
Heron is a realtime, distributed, fault-tolerant stream processing engine from Twitter
hdfs test
PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.
Welcome to use Jekyll theme Freshman21.
Apache Flink Stateful Functions
A CEP library for Flink to run Siddhi within Apache Flink streaming application
Apache Flink
A microframework based on Werkzeug, Jinja2 and good intentions
A Python interface for Facebook fastText
Read-only mirror of https://gerrit.hyperledger.org/r/#/admin/projects/fabric
ELK Stack 中文指南
Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch
Elasticsearch real-time search and analytics natively integrated with Hadoop
Drools Expert is the rule engine and Drools Fusion does complex event processing (CEP).
Performance monitoring and tuning tool for Apache Hadoop
设计模式源码,本项目与CSDN博客同步展示,希望将自己对于设计模式的认知展示出来与大家共同探讨和分析,在讲解设计模式的同时,将写博客期间写的示例进行分享,本项目会在博客更新后一段时间内进行代码示例的更新,请大家多多关注和支持!博客地址:http://blog.csdn.net/wangyang1354/
《Designing Data-Intensive Application》DDIA中文翻译
The current backend of DBToaster, implemented in Scala.
DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Dark-Tech : A New Hexo Theme
技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计、Java、Python、C++
ClickHouse® is a free analytics DBMS for big data
Big Data ETL and Utilities for Hadoop Map Reduce and Storm
Central Application Tracking
Mirror of Apache CarbonData
Caravel is a data exploration platform designed to be visual, intuitive, and interactive
Apache Calcite
搜狐视频(sohu tv)Redis私有云平台
KaiserY's Blog
Introduce blockchain related technologies, from theory to practice with bitcoin, ethereum and hyperledger.
Samples demonstrating the use of Blockchain with IBM Watson IoT
Azkaban workflow manager.
😎 A curated list of amazingly awesome Flink and Flink ecosystem resources
JavaScript & NodeJS Snippets for Atom Editor
后端架构师技术图谱
Apache Flink官方文档中文翻译计划
A admin dashboard application demo built upon Ant Design and Dva.js
Mirror of Apache Ambari
Alluxio, data orchestration for analytics and machine learning in the cloud
Ace (Ajax.org Cloud9 Editor)