feat: add read task in bigquery component #156

chuang8511 · 2024-06-10T18:12:27Z

Because

we want to read from data components

This commit

add read task to bigquery

linear · 2024-06-10T18:12:31Z

INS-4853 [Component] [BigQuery] create read task

chuang8511 · 2024-06-10T18:13:15Z

It is only for sync the task progress.
Will change PR status to open later.

donch1989 · 2024-06-12T01:33:39Z

data/bigquery/v0/main.go

+
+func constructTableColumns(myDataset *bigquery.Dataset, ctx context.Context, compConfig *base.ComponentConfig) ([]TableColumns, error) {
+	tableIT := myDataset.Tables(ctx)
+	tables := []TableColumns{}


How about we only fetch the table we want to use?

If we still want to fetch multiple tables, we can use a map with table_name as the key.

@donch1989
I was thinking to take the table name from setting parts to input.
Then, the users can choose the table / tables they want to extract with smart hint or drag-and-drop menu.
And, we also can adjust the columns without saving pipeline again if users want to change table.

What do you think about that?

Yeah, sounds good

I have created another ticket and leave TODO memo here.
I will deal with it in the near future.

donch1989 · 2024-06-17T00:08:52Z

data/bigquery/v0/insert.go

@@ -31,8 +32,10 @@ func insertDataToBigQuery(projectID, datasetID, tableName string, valueSaver Dat
 func getDataSaver(input *structpb.Struct, schema bigquery.Schema) (DataSaver, error) {
 	inputObj := input.GetFields()["data"].GetStructValue()
 	dataMap := map[string]bigquery.Value{}
+	transformer := base.InstillDynamicFormatTransformer{}


@chuang8511
I don't think we need to convert the case here. Since the schema is not defined in VDP, let's just keep it the same as it is in BigQuery.

@donch1989
I converted the schema from snake_case or camelCase to kebeb-case when we read the schema from BigQuery to save into VDP schema.

Here is the code

In this code, we did not modify the schema of BigQuery.
We modify the key to fetch the data from VDP.

Flow is like

Get schema from BigQuery

Change schema from snake_case / camelCase to kebab-case and save to VDP schema

user input data with the VDP schema, which is kebab-case.

Before we insert data into BigQuery, we fetch data with kebab-case from VDP components.

So, in the code you comment, we actually do not modify the schema in BigQuery.

Please correct me if I misunderstand.

Yeah, I know. But what if in BigQuery, there are two columns fooBar and foo_bar at the same time? The logic will go wrong.

@donch1989
So, do you mean that we do not have to transform BigQuery schema into kebab-case for VDP schema?
If so, I will revert this part & this code

Yes, I think we can just use the exact same names as those in BigQuery.

donch1989 · 2024-06-17T12:12:09Z

Please also help rebase it as well.

chuang8511 · 2024-06-17T16:19:17Z

@donch1989
Sync
Because there is a frontend bug, I have not done the final e2e test.
I will do the test after the Console bug is fixed.
Next time, I will put more time to try hardcode first to skip the bug.
Today, I happen to have a tight schedule, so I could not do it now.

Sorry...

chuang8511 · 2024-06-19T11:47:28Z

I have done end-to-end test.

Because - we want to read from data components This commit - add read task to bigquery

🤖 I have created a release *beep* *boop* --- ## [0.21.0-beta](v0.20.2-beta...v0.21.0-beta) (2024-07-02) ### Features * add mail component ([#178](#178)) ([04b19d0](04b19d0)) * add read task for gcs ([#155](#155)) ([77fe2fc](77fe2fc)) * add read task in bigquery component ([#156](#156)) ([4d2e7ec](4d2e7ec)) * **anthropic:** add Anthropic component ([#176](#176)) ([030881d](030881d)) * **anthropic:** add UsageHandler functions in anthropic ([#186](#186)) ([ebaa61f](ebaa61f)) * **compogen:** add extra section with --extraContents flag' ([#171](#171)) ([391bb98](391bb98)) * **instill:** remove extra-params field ([#188](#188)) ([b17ff73](b17ff73)) * **redis:** simplify the TLS configuration ([#194](#194)) ([0a8baf7](0a8baf7)) ### Bug Fixes * **all:** fix typos ([#174](#174)) ([cb3c2fb](cb3c2fb)) * **compogen:** wrong bracket direction in substitution ([#184](#184)) ([dfe8306](dfe8306)) * expose input and output for anthropic for instill credit ([#190](#190)) ([a36e876](a36e876)) * update doc ([#185](#185)) ([6e6639a](6e6639a)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).

droplet-bot added the instill component label Jun 10, 2024

donch1989 reviewed Jun 12, 2024

View reviewed changes

chuang8511 mentioned this pull request Jun 13, 2024

feat: add read task for gcs #155

Merged

chuang8511 requested a review from donch1989 June 13, 2024 17:26

chuang8511 marked this pull request as ready for review June 13, 2024 17:26

chuang8511 requested review from pinglin, xiaofei-du and jvallesm as code owners June 13, 2024 17:26

donch1989 reviewed Jun 17, 2024

View reviewed changes

chuang8511 force-pushed the chunhao/ins-4853 branch from 0462093 to f343235 Compare June 17, 2024 16:00

chuang8511 added 15 commits June 19, 2024 11:57

feat: add read task in bigquery component

65d3d8c

feat: add functions to init dynamic schema

edf3751

feat: finish read task implementation in bigquery component

497abd2

chore: add todo for future development

9bd550c

chore: add description in document

a89daf6

feat: dynamically make kebab-case

6ec3ef9

feat: handle no table found case

9592d81

fix: fix lint error

cb2743b

fix: convert snake to kebab when searching input

9783aa1

chore: add comment

91cf866

fix: revert bigquery schema customization

acf8541

fix: fix conflict error

db4a90b

chore: take out the unnecessary parts

7f1503a

chore: update document

71fa8fe

fix: fix bug after recipe revamp

3f82148

chuang8511 force-pushed the chunhao/ins-4853 branch from f1574b4 to 3f82148 Compare June 19, 2024 11:47

chuang8511 requested a review from donch1989 June 19, 2024 12:02

donch1989 merged commit 4d2e7ec into main Jun 24, 2024
8 checks passed

donch1989 deleted the chunhao/ins-4853 branch June 24, 2024 02:58

droplet-bot mentioned this pull request Jun 24, 2024

chore(main): release 0.21.0-beta #173

Merged

namwoam pushed a commit to namwoam/component that referenced this pull request Jun 24, 2024

feat: add read task in bigquery component (instill-ai#156)

df903e1

Because - we want to read from data components This commit - add read task to bigquery

namwoam pushed a commit to namwoam/component that referenced this pull request Jun 24, 2024

feat: add read task in bigquery component (instill-ai#156)

fe97ad9

Because - we want to read from data components This commit - add read task to bigquery

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add read task in bigquery component #156

feat: add read task in bigquery component #156

chuang8511 commented Jun 10, 2024

linear bot commented Jun 10, 2024

chuang8511 commented Jun 10, 2024

donch1989 Jun 12, 2024

chuang8511 Jun 12, 2024

donch1989 Jun 12, 2024

chuang8511 Jun 12, 2024

donch1989 Jun 17, 2024

chuang8511 Jun 17, 2024 •

edited

Loading

donch1989 Jun 17, 2024

chuang8511 Jun 17, 2024 •

edited

Loading

donch1989 Jun 17, 2024

donch1989 commented Jun 17, 2024

chuang8511 commented Jun 17, 2024

chuang8511 commented Jun 19, 2024

feat: add read task in bigquery component #156

feat: add read task in bigquery component #156

Conversation

chuang8511 commented Jun 10, 2024

linear bot commented Jun 10, 2024

chuang8511 commented Jun 10, 2024

donch1989 Jun 12, 2024

Choose a reason for hiding this comment

chuang8511 Jun 12, 2024

Choose a reason for hiding this comment

donch1989 Jun 12, 2024

Choose a reason for hiding this comment

chuang8511 Jun 12, 2024

Choose a reason for hiding this comment

donch1989 Jun 17, 2024

Choose a reason for hiding this comment

chuang8511 Jun 17, 2024 • edited Loading

Choose a reason for hiding this comment

donch1989 Jun 17, 2024

Choose a reason for hiding this comment

chuang8511 Jun 17, 2024 • edited Loading

Choose a reason for hiding this comment

donch1989 Jun 17, 2024

Choose a reason for hiding this comment

donch1989 commented Jun 17, 2024

chuang8511 commented Jun 17, 2024

chuang8511 commented Jun 19, 2024

chuang8511 Jun 17, 2024 •

edited

Loading

chuang8511 Jun 17, 2024 •

edited

Loading