-
Notifications
You must be signed in to change notification settings - Fork 1
/
index_old.html
406 lines (374 loc) · 21.4 KB
/
index_old.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Cocoon</title>
<!-- Include Bootstrap CSS -->
<link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/bootstrap@4.5.3/dist/css/bootstrap.min.css"
integrity="sha384-TX8t27EcRE3e/ihU7zmQxVncDAy5uIKz4rEkgIXeMed4M0jlfIDPvg6uqKI2xXr2" crossorigin="anonymous">
<script src="https://code.jquery.com/jquery-3.5.1.slim.min.js"
integrity="sha384-DfXdz2htPH0lsSSs5nCTpuj/zy4C+OGpamoFVy38MVBnE+IbbVYUew+OrCXaRkfj"
crossorigin="anonymous"></script>
<script src="https://cdn.jsdelivr.net/npm/bootstrap@4.5.3/dist/js/bootstrap.bundle.min.js"
integrity="sha384-ho+j7jyWK8fNQe+A12Hb8AhRq26LrZ/JpcUGGOn+Y7RsweNrtN/tE3MoK7ZeZDyx"
crossorigin="anonymous"></script>
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.3/css/all.min.css">
</head>
<style>
.row {
display: flex;
flex-wrap: wrap;
}
.card {
height: 100%;
display: flex;
flex-direction: column;
}
.card-body {
flex-grow: 1;
}
</style>
<body>
<nav class="navbar navbar-expand-lg navbar-dark bg-dark">
<a class="navbar-brand" href="https://cocoon-data-transformation.github.io/page/">
Cocoon: Intelligence Layer for GenAI
</a>
<button class="navbar-toggler" type="button" data-toggle="collapse" data-target="#navbarNav"
aria-controls="navbarNav" aria-expanded="false" aria-label="Toggle navigation">
<span class="navbar-toggler-icon"></span>
</button>
<div class="collapse navbar-collapse" id="navbarNav">
<ul class="navbar-nav ml-auto">
<li class="nav-item">
<a class="nav-link" href="https://cocoon-data-transformation.github.io/page/profile">Profile</a>
</li>
<li class="nav-item">
<a class="nav-link" href="https://cocoon-data-transformation.github.io/page/clean">Clean</a>
</li>
<li class="nav-item">
<a class="nav-link"
href="https://cocoon-data-transformation.github.io/page/standardize">Standardize</a>
</li>
<li class="nav-item">
<a class="nav-link" href="https://cocoon-data-transformation.github.io/page/lineage">Lineage</a>
</li>
<li class="nav-item">
<a class="nav-link" href="https://cocoon-data-transformation.github.io/page/model">Catalog</a>
</li>
<li class="nav-item">
<a class="nav-link" href="https://cocoon-data-transformation.github.io/page/transform">Transform</a>
</li>
<li class="nav-item">
<a class="nav-link" href="https://cocoon-data-transformation.github.io/page/genai">GenAI</a>
</li>
<li class="nav-item">
<a class="nav-link" href="https://cocoon-data-transformation.github.io/page/pipeline">Pipeline</a>
</li>
</ul>
</div>
</nav>
<section class="py-5 text-center container">
<div class="row py-lg-5">
<div class="col-lg-6 col-md-12 mx-auto pb-4">
<img src="https://github.com/Cocoon-Data-Transformation/page/blob/main/material/logo.png?raw=true"
alt="Logo" class="img-fluid mb-4">
</div>
<div class="col-lg-8 col-md-12 mx-auto pb-4">
<p class="lead text-body-secondary">
Cocoon is a cross-team platform for creating and managing the next-gen data catalog, transforming
large, messy enterprise data into GenAI-readable intelligence. </p>
</div>
<div class="col-lg-8 col-md-12 mx-auto">
<img src="https://github.com/Cocoon-Data-Transformation/page/blob/main/genai_gallery/genai2.png?raw=true"
class="p-4 img-fluid border border-3 rounded shadow">
</div>
</div>
</section>
<section class="py-5 bg-light">
<div class="container">
<div class="row py-lg-5 text-center">
<div class="col-lg-8 col-md-12 mx-auto">
<h2>AI is Hard Because Data is Hard</h2>
<p class="lead text-body-secondary">If Humans Can't Understand Enterprise Data, Neither Can AI</p>
</div>
<div class="col-lg-8 col-md-12 mx-auto mb-5">
<img src="https://github.com/Cocoon-Data-Transformation/page/blob/main/genai_gallery/genai1.png?raw=true"
class="p-4 img-fluid border border-3 rounded shadow">
</div>
<div class="col-lg-8 col-md-12 mx-auto mt-5">
<h2>Cocoon unlocks AI in 3 steps</h2>
</div>
</div>
<div class="row">
<div class="col-md-4 my-5">
<div class="card shadow">
<div class="card-header bg-dark text-white py-4 text-center">
<h4 class="mb-2">
<i class="fas fa-book-open" style="margin-right: 10px;"></i>
<span style="vertical-align: middle;">Catalog for GenAI</span>
</h4>
<p class="mb-0">Semantically describe your data</p>
</div>
<div class="card-body p-4">
<ul class="list-unstyled">
<li class="mb-4">
<h6 class="fw-bold">Detailed catalog in YML</h6>
<p class="text-muted small mb-0">
Describe your tables and files in detail. Summarize their relationships and connect them to the business.
</p>
</li>
<li class="mb-4">
<h6 class="fw-bold">Mostly in Natural Language</h6>
<p class="text-muted small mb-0">Easy for LLMs to use. Friendly for humans to understand, verify and maintain.</p>
</li>
<li class="mb-0">
<h6 class="fw-bold">Incorporate Enterprise Knowledge</h6>
<p class="text-muted small mb-0">Encompasses existing enterprise knowledge: DWs, lakes, documents, dbt models, human feedback, etc.</p>
</li>
</ul>
</div>
</div>
</div>
<div class="col-md-4 my-5">
<div class="card shadow">
<div class="card-header bg-dark text-white py-4 text-center">
<h4 class="mb-2">
<i class="fas fa-exclamation-triangle" style="margin-right: 10px;"></i>
<span style="vertical-align: middle;">Quality Monitoring</span>
</h4>
<p class="mb-0">Continuously identify and resolve issues</p>
</div>
<div class="card-body p-4">
<ul class="list-unstyled">
<li class="mb-4">
<h6 class="fw-bold">Semantic Data Quality</h6>
<p class="text-muted small mb-0">Profile data semantically. Alert issues about data ambiguity, semantic overlap, and business logics.</p>
</li>
<li class="mb-4">
<h6 class="fw-bold">Human-In-the-Loop</h6>
<p class="text-muted small mb-0">Seek clarification and verification from data providers. Explain the issues in natural language. </p>
</li>
<li class="mb-0">
<h6 class="fw-bold">Continuous Monitoring</h6>
<p class="text-muted small mb-0">Contextualize new data from the existing ones. Identify semantic inconsistency issues. </p>
</li>
</ul>
</div>
</div>
</div>
<div class="col-md-4 my-5">
<div class="card shadow">
<div class="card-header bg-dark text-white py-4 text-center">
<h4 class="mb-2">
<i class="fas fa-cogs" style="margin-right: 10px;"></i>
<span style="vertical-align: middle;">Data Preprocessing</span>
</h4>
<p class="mb-0">Shape your data, easy for online query</p>
</div>
<div class="card-body p-4">
<ul class="list-unstyled">
<li class="mb-4">
<h6 class="fw-bold">Comprehensive Modeling in SQL</h6>
<p class="text-muted small mb-0">
Automatically write SQL for data modeling (casting, expanding JSON, SCD, normalization, etc.)
</p>
</li>
<li class="mb-4">
<h6 class="fw-bold">
Data Pipelines in DBT
</h6>
<p class="text-muted small mb-0">Follow the existing best practices from DBT (CI/CD, incremental materialization, governance, etc.) </p>
</li>
<li class="mb-0">
<h6 class="fw-bold">Testing and Maintenance</h6>
<p class="text-muted small mb-0">
Automatically suggest tests based on semantic understanding. Propose fixes to broken pipelines.</p>
</li>
</ul>
</div>
</div>
</div>
</div>
</section>
<section class="py-5 text-center container">
<div class="container-fluid">
<div class="row justify-content-center py-lg-5">
<div class="col-lg-10 col-md-12 mx-auto pb-4">
<h2 class="fw-bold mb-4">Use cases: with Cocoon, Enterprise GenAI finally works</h2>
<div class="row">
<div class="col-md-6 my-5">
<div class="card shadow">
<div class="card-header bg-dark text-white py-4 text-center">
<i class="fas fa-lightbulb fa-3x mt-3"></i>
<h4 class="mb-2">
<span style="vertical-align: middle;">Unlock AI that failed before</span>
</h4>
<p class="mb-0">with catalog, serving as GPS for LLMs, enabling: </p>
</div>
<div class="card-body p-4">
<ul class="list-unstyled">
<li class="mb-4">
<h6 class="fw-bold text-dark">Data-informed customer-facing AI agents</h6>
</li>
<li class="mb-4">
<h6 class="fw-bold text-dark">Self-service data analytics & reporting</h6>
</li>
<li class="mb-4">
<h6 class="fw-bold text-dark">Business support functions’ co-pilots </h6>
</li>
<li class="mb-0">
<h6 class="fw-bold text-dark">Data-driven business processes optimization</h6>
</li>
</ul>
</div>
</div>
</div>
<div class="col-md-6 my-5">
<div class="card shadow">
<div class="card-header bg-dark text-white py-4 text-center">
<i class="fas fa-cogs fa-3x mt-3"></i>
<h4 class="mb-2">
<span style="vertical-align: middle;">10x AI, Data & Analytics teams</span>
</h4>
<p class="mb-0">with co-pilot automating data preparation tasks:</p>
</div>
<div class="card-body p-4">
<ul class="list-unstyled">
<li class="mb-4">
<h6 class="fw-bold text-dark">Org’s churn- & politics-resistant source of truth
</h6>
</li>
<li class="mb-4">
<h6 class="fw-bold text-dark">Discovery & understanding of new data sources
</h6>
</li>
<li class="mb-4">
<h6 class="fw-bold text-dark">Creating & maintaining data pipelines
</h6>
</li>
<li class="mb-0">
<h6 class="fw-bold text-dark">Lineage, quality & business issues identification
</h6>
</li>
</ul>
</div>
</div>
</div>
</div>
<h2 class="fw-bold mb-4">Watch our demo</h2>
<div class="embed-responsive embed-responsive-16by9">
<iframe class="embed-responsive-item" src="https://www.youtube.com/embed/xdmRXs0UnfE" allowfullscreen></iframe>
</div>
</div>
</div>
</div>
</section>
<section class="py-5 bg-light">
<div class="container">
<div class="text-center" >
<h2>Related Research</h2>
<p class="text-center lead text-body-secondary mb-3">Have questions? Contact me at <a
href="mailto:zh2408@columbia.edu">zh2408@columbia.edu</a></p>
<a href="https://github.com/Cocoon-Data-Transformation/cocoon" class="btn btn-dark btn-lg mb-5"
target="_blank">
<i class="fab fa-github mr-2"></i> Visit GitHub
</a>
</div>
<div class="row row-cols-1 row-cols-sm-2 row-cols-md-3 g-4">
<div class="col mb-5">
<div class="card shadow-sm h-100">
<div class="card-body d-flex flex-column">
<h5 class="card-title">Profile</h5>
<p class="card-text flex-grow-1">Semantically understand your table and identify anomalies.
</p>
<a href="https://cocoon-data-transformation.github.io/page/profile"
class="btn btn-secondary mt-auto">Learn More</a>
</div>
</div>
</div>
<div class="col mb-5">
<div class="card shadow-sm h-100">
<div class="card-body d-flex flex-column">
<h5 class="card-title">Clean</h5>
<p class="card-text flex-grow-1">Clean, format and document your tables with SQL and YAML.
</p>
<a href="https://cocoon-data-transformation.github.io/page/clean"
class="btn btn-secondary mt-auto">Learn More</a>
</div>
</div>
</div>
<div class="col mb-5">
<div class="card shadow-sm h-100">
<div class="card-body d-flex flex-column">
<h5 class="card-title">Standardize</h5>
<p class="card-text flex-grow-1">Standardize column values to given vocabularies.</p>
<a href="https://cocoon-data-transformation.github.io/page/standardize"
class="btn btn-secondary mt-auto">Learn More</a>
</div>
</div>
</div>
<div class="col mb-5">
<div class="card shadow-sm h-100">
<div class="card-body d-flex flex-column">
<h5 class="card-title">Catalog</h5>
<p class="card-text flex-grow-1">Catalog your data warehouses into easy-to-understand forms.</p>
<a href="https://cocoon-data-transformation.github.io/page/model"
class="btn btn-secondary mt-auto">Learn More</a>
</div>
</div>
</div>
<div class="col mb-5">
<div class="card shadow-sm h-100">
<div class="card-body d-flex flex-column">
<h5 class="card-title">Tranform</h5>
<p class="card-text flex-grow-1">Transform from source to target databases with
SQL.</p>
<a href="https://cocoon-data-transformation.github.io/page/transform"
class="btn btn-secondary mt-auto">Learn More</a>
</div>
</div>
</div>
<div class="col mb-5">
<div class="card shadow-sm h-100">
<div class="card-body d-flex flex-column">
<h5 class="card-title">GenAI</h5>
<p class="card-text flex-grow-1">Build a semantic catalog to organize your data, ready for GenAI.</p>
<a href="https://cocoon-data-transformation.github.io/page/genai"
class="btn btn-secondary mt-auto">Learn More</a>
</div>
</div>
</div>
<div class="col mb-5">
<div class="card shadow-sm h-100">
<div class="card-body d-flex flex-column">
<h5 class="card-title">Lineage</h5>
<p class="card-text flex-grow-1">Learn your data semantically from existing data pipelines.</p>
<a href="https://cocoon-data-transformation.github.io/page/lineage"
class="btn btn-secondary mt-auto">Learn More</a>
</div>
</div>
</div>
<div class="col mb-5">
<div class="card shadow-sm h-100">
<div class="card-body d-flex flex-column">
<h5 class="card-title">Pipeline</h5>
<p class="card-text flex-grow-1">RAG large data pipeline using lineage</p>
<a href="https://cocoon-data-transformation.github.io/page/pipeline"
class="btn btn-secondary mt-auto">Learn More</a>
</div>
</div>
</div>
</div>
</div>
<!-- <div class="col-lg-8 col-md-12 mx-auto">
<div class="mt-4 text-center">
<a href="https://github.com/Cocoon-Data-Transformation/cocoon" class="btn btn-dark btn-lg"
target="_blank">
<i class="fab fa-github mr-2"></i> Visit GitHub
</a>
</div>
</div> -->
</section>
</body>
</html>