<img src="https://user-images.githubusercontent.com/57335825/88050180-373c5100-cb0b-11ea-9d75-d02303846f3b.png" alt="Pick of the Week" />
Normally the weekly issue covers Feature Explanation and Community Q&As. If something major happens, it will also be covered in the additional Events of the Week section.
<h2 id="post-events-of-the-week">Events of the Week</h2>
This <a href="https://docs.nebula-graph.io/manual-EN/5.appendix/sql-ngql/">Comparison Between SQL and nGQL</a> document will help you transit from SQL to nGQL and quickly understand the usage of nGQL.
<img src="https://user-images.githubusercontent.com/57335825/88051041-d44bb980-cb0c-11ea-87bf-91fa4b36b084.png" alt="Comparison Between SQL and nGQL" />
<h2 id="post-feature-explanation">Feature Explanation</h2>
This week let's talk about the Job Manager feature.
There are some time-consuming tasks running at the storage layer of NebulaGraph. For such tasks, we provide Job Manager, a management tool, for releasing, managing, and querying the jobs.
We currently provide Compact and Flush for task releasing. Compact is usually used to clear data that has been marked for deletion from the storage layer, and flush is used to write back the memfile in memory to the hard disk.
Scenario #1: Release RocksDB Compact Task.
To release a RocksDB compact task, you can run the <code>SUBMIT JOB COMPACT;</code> command and a Job ID is returned, as shown in the following figure:
<img src="https://user-images.githubusercontent.com/57335825/88051211-1bd24580-cb0d-11ea-837b-126099dac26d.png" alt="SUBMIT JOB COMPACT" />
To write the RocksDB memfile in the memory to the hard disk, you can run the <code>SUBMIT JOB FLUSH;</code> command and a Job ID is returned, as shown in the following figure:
<img src="https://user-images.githubusercontent.com/57335825/88051325-43291280-cb0d-11ea-81f2-e4bb56110b8e.png" alt="SUBMIT JOB FLUSH" />
Scenarios #2: Query Tasks.
In terms of task query, we support to list a single job or all jobs for task querying.
To find all the jobs, you can run the <code>SHOW JOBS;</code> statement to do the full job query. All the job IDs are returned, as shown in the following figure：
<img src="https://user-images.githubusercontent.com/57335825/88051557-aadf5d80-cb0d-11ea-81bc-33f5dcd37bd4.png" alt="SHOW JOBS" />
In this job list, you can see information such as Job ID, Task ID, commands and the landing nodes.
After obtaining the specific ID of a job, you can use <code>SHOW JOB &lt;JOB ID&gt;;</code> to query the details. From the job details, you can get the Task ID of each task. Generally, each node for the storaged service has one Task ID, so the number of Task IDs depends on the number of nodes for the storaged service.
<img src="https://user-images.githubusercontent.com/57335825/88051708-e11cdd00-cb0d-11ea-8c17-6e8cd2f7041b.png" alt="SHOW JOB" />
The command <code>STOP JOB &lt;JOB ID&gt;;</code> can be used to suspend ongoing tasks.
<img src="https://user-images.githubusercontent.com/57335825/88051801-175a5c80-cb0e-11ea-9ef5-f46efae74453.png" alt="STOP JOB" />
You can also use <code>RECOVER JOB;</code> to resume suspended tasks.
<img src="https://user-images.githubusercontent.com/57335825/88051878-3ce76600-cb0e-11ea-8563-9a5efcd6e79c.png" alt="RECOVER JOB" />
<h2 id="post-community-qa">Community Q&A</h2>
Q: The LOOKUP query is quite slow. What can I do to optimize the performance?
A: The slow query speed may be caused by the unordered data. The LOOKUP statement needs schema indexes to work well, but the indexes can retrieve data efficiently only when the data are in order. If a large amount of data is imported, we recommend that you do a Compact operation on the data to make them in order. By doing so, the LOOKUP query speed can be improved.
<h2 id="post-you-might-also-like">You Might Also Like</h2>
<ol>
<li><a href="https://nebula-graph.io/posts/nebula-graph-pick-of-the-week-jul-10-2020/">Pick of the Week 28 at NebulaGraph - Running Configuration Explained in Detail</a></li>
<li><a href="https://nebula-graph.io/posts/nebula-graph-pick-of-the-week-jul-24-2020/">Pick of the Week 30 at NebulaGraph - FETCH Syntax Goes Further with New Features</a></li>
</ol>

In this weekly issue, we are covering Job Manager and how to optimize the LOOKUP query in NebulaGraph. We have also prepared a guide for DBAs to compare SQL and nGQL.

Features

System-testing

Dev-log

Performance

Use-cases

Community

Deployment

Tools

Query-language

Graph-computing

Tech-talk

Release-notes

Architecture

News

Source-code

Success-stories

Knowledge-graph

Pick of the Week at NebulaGraph - SQL vs nGQL & Job Manager in NebulaGraph

Benchmarking the Mainstream Open Source Distributed Graph Databases at Meituan: NebulaGraph vs Dgraph vs JanusGraph

Graph Database vs Relational Database: What to Choose?

Difference Between Relational and Non Relational Database

Use cases of graph databases in real-time recommendation

Interact with biomedical knowledge graphs using graph-based retrieval augmented generation.

BioGraphRAG - Biomedical Knowledge Graph Retrieval Augmented Generation

As the Web3 ecosystem continues to expand, blockchain-based transactions have introduced a new paradigm of decentralized finance (DeFi) characterized by anonymity, intricate interaction paths, and a lack of centralized oversight. While this innovation offers unparalleled freedom and flexibility, it also creates significant challenges for risk management and compliance. **Traditional tools like rule engines, relational databases, and statistical analysis struggle to keep pace with the dynamic and the hidden nature of Web3 threats. Enter graph databases—a technology uniquely suited to model and analyze the complex relationships inherent in blockchain networks.**

In this article, we’ll explore why graph databases are becoming indispensable for Web3 risk management, how they address the shortcomings of traditional solutions, and their transformative impact on combating fraud, money laundering, and other illicit activities.

# The Challenges of Web3 Risk Management
## Anonymity and Lack of Context
In Web3, user identities are represented by blockchain addresses, which do not require real-name registration. This anonymity, combined with privacy-enhancing protocols and privacy coins, makes it nearly impossible to trace the origins of transactions or identify malicious actors. For example, attackers often exploit these features to launder money through multi-layered transfers or obfuscate transaction paths using time delays and decoy addresses. Traditional risk management systems, which rely on static rules and predefined thresholds, are ill-equipped to handle such sophisticated evasion tactics.

## Limitations of Rule Engines
Traditional rule-based systems operate on simple heuristics, such as flagging an address after a certain number of consecutive transactions or identifying high-value transfers as suspicious. While effective against predictable patterns, these rules falter when faced with the adaptive strategies of Web3 attackers. For instance, a money-laundering operation might split large sums into smaller amounts distributed across multiple addresses, making it difficult for static rules to detect the underlying scheme. Without the ability to understand the broader context of transactions, rule engines are rendered ineffective.

## Isolated Data Analysis
Statistical methods used in traditional risk management focus on individual metrics, such as transaction frequency or average transfer amounts. However, these approaches fail to capture the interconnected nature of Web3 transactions. Many fraudulent activities, such as Sybil attacks or airdrop farming, involve coordinated actions across multiple addresses controlled by a single entity. Analyzing each address in isolation misses the bigger picture, leaving organizations vulnerable to "group-based" attacks.

## Performance and Real-Time Constraints
The sheer scale and complexity of blockchain data pose additional challenges. Traditional relational databases rely on complex joins and queries to map relationships between entities, leading to slow performance and high resource consumption. In contrast, Web3 demands real-time risk detection and mitigation, which legacy systems cannot deliver.

# How Graph Databases Handle Web3 Risk Challenges
## Native Representation of Complex Relationships
At its core, Web3 operates as a vast network of interconnected entities—addresses, transactions, smart contracts, and protocols. Unlike relational databases, which require multiple tables and joins to represent these connections, graph databases model this data natively. Addresses become nodes, while transactions and interactions become edges, creating a semantic-rich and intuitive representation of the blockchain ecosystem. This structure simplifies modeling and enables lightning-fast queries, even across billions of nodes and edges.

For example, tracking a fund flow from a high-risk address involves traversing multiple layers of transactions. A graph database can perform this task in milliseconds, identifying all downstream addresses within 10 hops of the original source. This capability is crucial for real-time risk assessment and response.

## Pattern Recognition and Structural Analysis
Web3 fraud often exhibits distinct structural characteristics, such as circular transactions where funds loop back to their origin, star-shaped aggregations (multiple addresses funneling funds to a central account), or shared-source clusters (multiple users linked to the same IP). Graph databases excel at detecting these patterns through subgraph matching techniques, allowing systems to identify suspicious behaviors with remarkable accuracy.

By integrating with risk decision platforms, graph databases enable real-time alerts and dynamic analysis. As soon as an abnormal transaction or connection to a known blacklist node is detected, the system can trigger immediate warnings, far outpacing traditional batch-processing methods.

## Advanced Graph Algorithms for Fraud Detection
Graph databases come equipped with powerful algorithms that uncover hidden risks in large-scale networks. Take Community Detection Algorithms and Connected Components Analysis for instance. Techniques like Louvain clustering group highly interconnected addresses into potential fraud clusters. These clusters may represent orchestrated attacks, such as airdrop abuse or coordinated money laundering. Connected Components Analysis is often leveraged by Identifying isolated subgraphs to help detect rogue groups operating independently from the broader network. Such structures are often indicative of automated scripts or bot-controlled addresses.

These advanced graph algorithms allow graph databases to go beyond simple anomaly detection, providing deep insights into the structural vulnerabilities of Web3 ecosystems.

# The Future of Web3 Risk Management
As Web3 continues to reshape the financial landscape, the need for robust risk management solutions has never been greater. Graph databases offer a transformative approach to addressing the unique challenges posed by decentralized, anonymous, and complex blockchain environments. By enabling precise, real-time analysis of transaction networks, they empower organizations to stay ahead of evolving threats while ensuring regulatory compliance.

**For businesses operating in the Web3 space, adopting [graph database](https://www.nebula-graph.io/) technology is no longer optional—it’s essential.** Whether you’re a cryptocurrency exchange, a DeFi protocol, or a blockchain analytics firm, leveraging graph databases will help you unlock the full potential of your data and build a safer, more resilient Web3 ecosystem.


In the rapidly evolving world of Web3, where blockchain transactions are decentralized, anonymous, and complex, graph databases are emerging as the critical solution to tackle fraud, money laundering, and other risks with precision and speed.

Why Graph Models Outperform Traditional Tools in Web3 Risk Management

In the age of AI, competitive advantage hinges on how deeply organizations can understand their data. **With the release of NebulaGraph Enterprise v5.1, we're introducing native vector processing capabilities that address a critical demand in today’s AI and large language model development: hybrid queries combining graph structures and vector semantics.**

This enhancement significantly expands NebulaGraph Enterprise’s ability to handle unstructured data such as text, audio, and video—unlocking richer insights across domains like financial risk control and knowledge graph construction. The result is a more agile and intelligent way for enterprises to extract value from their data, backed by enterprise-grade reliability through distributed architecture improvements and cross-region disaster recovery.

# Native Vector Support Uncovers Hidden Connections  
In AI-driven applications, the gap between structured relationships and unstructured semantics often limits the depth of insight. With native vector support, NebulaGraph Enterprise v5.1 bridges this divide by integrating vector search into the graph model.

**Vectors are essentially becoming part of a node’s “attribute DNA.” This allows users to perform relationship traversal and vector similarity searches within the same query.** For example, entity relationships in an equity ownership chain can be analyzed alongside semantic intent extracted from contract texts, while sensor networks in industrial equipment graphs can be automatically linked to anomaly patterns in log messages. This unified approach opens up entirely new dimensions for data exploration and interpretation.

![image](https://www-cdn.nebula-graph.io/blogs/product_architecture_5.1.jpg)
*NebulaGraph Enterprise v5.1 Architecture*

# Up to 5.5x Performance Boost
On the performance front, NebulaGraph Enterprise v5.1 delivers major advancements. **Benchmarks based on the LDBC SF100 dataset show a 550% increase in query throughput compared to the previous generation.** Complex deep-link queries see dramatic improvements—for instance, a 10-hop equity penetration query now runs in just 1.5 seconds, down from 8.2 seconds. This effectively creates direct paths through intricate relationship networks, enabling faster, more responsive decision-making.

Additionally, full backup efficiency for ultra-large-scale graphs has been drastically improved. **A trillion-node dataset that previously required 72 hours can now be backed up in just 9 hours—greatly enhancing business continuity and operational resilience.**

# Enterprise-Grade Resilience and Security
To meet the stringent availability requirements of industries like finance and government, NebulaGraph Enterprise v5.1 introduces a robust disaster recovery and security framework.

The new cross-Zone active-active architecture leverages a multi-site, three-center design along with Raft consensus to ensure seamless traffic failover and zero data loss in case of local outages. For long-distance disaster recovery, manual switching and zone-aware scheduling minimize network latency and resource consumption—ensuring rapid service restoration even under extreme conditions.

On the security front, v5.1 offers end-to-end protection across access control, data transmission, and storage. Fine-grained permission management follows the principle of least privilege, allowing administrators to restrict access to specific subgraphs or attributes. Full TLS/mTLS encryption ensures secure data-in-transit, safeguarding sensitive information at every stage.

# Future-Ready Intelligence Infrastructure
With NebulaGraph Enterprise v5.1, the fusion of graph and vector technologies reaches a new level of maturity. By uniting structured relationships with unstructured semantics, and combining it with breakthroughs in performance, security, and enterprise readiness, we’re empowering businesses across finance, manufacturing, healthcare, and beyond to build smarter, more responsive data infrastructures.

Ready to explore what the future of data intelligence looks like? [Contact us](https://www.nebula-graph.io/contact) to unlock the power of integrated graph and vector analytics.


With the release of NebulaGraph Enterprise v5.1, we're introducing native vector processing capabilities that address a critical demand in today’s AI and large language model development: hybrid queries combining graph structures and vector semantics.