知識圖譜(Knowledge Graph),在圖書情報界稱為知識域視覺化或知識領域對映地圖,是顯示知識發展程序與結構關係的一系列各種不同的圖形,用視覺化技術描述知識資源及其載體,挖掘、分析、構建、繪製和顯示知識及它們之間的相互聯絡。
知識圖譜是透過將應用數學、圖形學、資訊視覺化技術、資訊科學等學科的理論與方法與計量學引文分析、共現分析等方法結合,並利用視覺化的圖譜形象地展示學科的核心結構、發展歷史、前沿領域以及整體知識架構達到多學科融合目的的現代理論。它能為學科研究提供切實的、有價值的參考。
本資源整理了知識圖譜相關開源工具,涉及知識圖譜公開資料集、知識圖譜儲存、視覺化、知識融合、知識圖譜計算相關工具、資料。
目錄
基礎架構
o圖資料庫工具
o三元組儲存工具
o圖計算框架工具
o圖視覺化工具
o圖處理相關程式語言
o圖應用服務工具
知識工程
o知識融合
知識圖譜資料集
o一般資料集
o語義網路資料集
o學術資料集
學習資料
o官方文件
o學習社群資料
基礎架構
圖資料庫工具
AgensGraph - multi-model graph database with SQL and Cypher support based on PostgreSQL
ArangoDB - highly available Multi-Model NoSQL database
Blazegraph - GPU accelerated graph database
Cayley - open source database written in Go
CosmosDB - cloud-based multi-model database with support for TinkerPop3
Dgraph - Fast, Transactional, Distributed Graph Database (open source, written in Go)
DSE Graph - Graph layer on top of DataStax Enterprise (Cassandra, SolR, Spark)
Grakn。AI - a distributed hyper-relational database for knowledge-oriented systems, i。e。 a distributed knowledge base
Graphd - the Metaweb/Freebase Graph Repository
JanusGraph - an open-source, distributed graph database with pluggable storage and indexing backends
Memgraph - High Performance, In-Memory, Transactional Graph Database
Neo4j - OLTP graph database
Sparksee - makes space and performance compatible with a small footprint and a fast analysis of large networks
Stardog - RDF graph database with OLTP and OLAP support
OrientDB - Distributed Multi-Model NoSQL Database with a Graph Database Engine
TigerGraph - a complete, distributed, parallel graph computing platform for enterprise, supporting web-scale data analytics in real-time。
Nebula Graph - A truly distributed, linear scalable, lightning-fast graph database, using SQL-like query language。
HugeGraph - An open source TinkerPop 3 compliant OLTP Graph Database with pluggable storage bakcend which is similar to JanusGraph。 It also supports OLAP through Spark GraphX。
三元組儲存工具
AllegroGraph - high-performance, persistent graph database that scales to billions of quads
Apache Jena - open source Java framework for building Semantic Web and Linked Data applications
Eclipse RDF4J - (formerly known as Sesame) is an open source Java framework for processing RDF data。 This includes parsing, storing, inferencing and querying of/over such data。 It offers an easy-to-use API that can be connected to all leading RDF storage solutions。 It allows you to connect with SPARQL endpoints and create applications that leverage the power of linked data and Semantic Web。
GraphDB - enterprise ready Semantic Graph Database, compliant with W3C Standards
Virtuoso - a “Data Junction Box” that drives enterprise and individual agility by deriving a Semantic Web of Linked Data from existing data silos
Hoply - explore bigger than RAM relational data in the comfort of Python。
圖計算框架工具
Apache Giraph - an iterative graph processing system built for high scalability
Apache TinkerPop - a graph computing framework for both graph databases (OLTP) and graph analytic systems (OLAP)
Apache Spark - GraphX - Apache Spark‘s API for graphs and graph-parallel computation
Tencent Plato - a fast distributed graph computation and machine learning framework used by WeChat。
圖視覺化工具
AntV G6 - Simple, easy and complete high performance graph visualization engine written in JavaScript, from Ant Financial
Gephi - Graph visualization platform software runs on Windows, Mac and Linux。
KeyLines & ReGraph - Graph visualization tookits for JavaScript and React developer from Cambridge Intelligence。
Linkurious - Linkurious is an enterprise ready on-premises graph visualization and analysis platform。
圖處理相關程式語言
Cypher
Gremlin
SPARQL
GraphQL+- - The query language of Dgraph, which is based on Facebook’s GraphQL
GQL - An initiative to create a standard query language for property graph database, just like SQL for relational database。
圖應用服務工具
CosmosDB @ Microsoft - Azure Cosmos DB is Microsoft‘s globally distributed, multi-model (Key-value, Document, Column, Graph) database service。
JanusGraph @ IBM Compose
JanusGraph @ Google Cloud Platform - JanusGraph on Google Kubernetes Engine backed by Google Cloud Bigtable
JanusGraph @ Amazon Web Services Labs - The Amazon DynamoDB Storage Backend for JanusGraph
Neo4j @ Graphene
Neo4j @ Graph Story - End-to-end Graph Database hosting for Community and Enterprise Neo4j with expert help for development
Neptune @ Amazon Web Services - a fast, reliable, fully-managed graph database service that makes it easy to build and run applications that work with highly connected datasets
Graph Engine Service @ Huawei Cloud - Fully-managed, distributed, at-scale graph query and analysis service that provides a visualized interactive analytics platform。
Graph Database (beta) @ Aliyun (Alibaba Cloud) - highly reliable and available property graph database that supports ACID and TinkerPop Gremlin query language。
Tencent Knowledge Graph @ Tencent Cloud - One stop platform for Graph database, computing and visualization。 Currently available in beta test and only in Chinese。
知識工程
YAGA-NAGA - Harvesting, Searching, and Ranking Knowledge from the Web
知識融合
Dedupe - dedupe is a python library that uses machine learning to perform fuzzy matching, deduplication and entity resolution quickly on structured data。
LIMES - Link Discovery Framework for Metric Spaces。
知識圖譜資料集
一般資料集
BabelNet - Both a multilingual encyclopedic dictionary, with lexicographic and encyclopedic coverage of terms, and a semantic network which connects concepts and named entities in a very large network of semantic relations, made up of about 16 million entries, called Babel synsets。 Each Babel synset represents a given meaning and contains all the synonyms which express that meaning in a range of different languages。
Wikidata - Wikidata is a free, collaborative, multilingual, secondary database, collecting structured data to provide support for Wikipedia, Wikimedia Commons, the other wikis of the Wikimedia movement, and to anyone in the world。
Google Knowledge Graph - Google’s Knowledge Graph has millions of entries that describe real-world entities like people, places, and things。
DBpedia - DBpedia is a crowd-sourced community effort to extract structured content from the information created in various Wikimedia projects。
XLore - A large-scale English-Chinese bilingual knowledge graph by structuring and integrating Chinese Wikipedia, English Wikipedia, French Wikipedia, and Baidu Baike。
The GDELT Project - The GDELT Project monitors the world’s broadcast, print, and web news from nearly every corner of every country in over 100 languages and identifies the people, locations, organizations, themes, sources, emotions, counts, quotes, images and events driving our global society every second of every day, creating a free open platform for computing on the entire world。
YAGO - A huge semantic knowledge base, derived from Wikipedia, WordNet and GeoNames。 Currently, YAGO has knowledge of more than 10 million entities (like persons, organizations, cities, etc。) and contains more than 120 million facts about these entities。 The source code of YAGO is in this Github repo。
Zhishi。me - Knowledge Graph data extracted from the largest Chinese encyclopedias, Baidu Baike, Hudong Baike and Chinese Wikipedia。
語義網路資料集
ConceptNet - ConceptNet is a freely-available semantic network, designed to help computers understand the meanings of words that people use。
Microsoft Concept Graph - For Short Text Understanding
OpenHowNet - An Open Sememe-based Lexical Knowledge Base in Chinese。
WordNet - A free large lexical database of English from Princeton University。
學術資料集
AMiner - Aminer aims to provide comprehensive search and mining services for researcher social networks。
Microsoft Academic - Microsoft Academic (MA) employs advances in machine learning, semantic inference and knowledge discovery to help you explore scholarly information in more powerful ways than ever before。
AceMap - Academic search engine based on knowledge graph which includes entities like paper, author, institution and etc。
學習資料
官方文件
Cypher - reference documentation
Gremlin - reference documentation
學習社群資料
Graph Book - TinkerPop3 centric book written by Kelvin R。 Lawrence
SQL2Gremlin - transition from SQL to Gremlin by Daniel Kuppitz
The Gremlin Compendium - minimum survival kit for any Gremlin user, 10 blog post series by Doan DuyHai
相關會議
Graph Connect - powered by Neo4j
Graph Day - an Independent Graph Conference from the Data Day folks