- [enhancement] Support column-based partition (thanks to Chi-Ruei Li)
- [enhancement] Enable object lifecycle management when creating buckets with
auto_create_gcs_bucket(thanks to @potato2003)
- [enhancement] Support location option for
auto_create_gcs_bucketoption (thanks to @potato2003)
- [enhancement] Support location option to allow to use 'asia-northeast1' region
- [enhancement] Support auth_method 'application_default'
- [maintenance] Fix deprecated warning log condition for
timeout_sec
- [maintenance] Support google-api-ruby-client >= v0.11.0
- [maintenance] Add
send_timeout_secandread_timeout_secoption for google-api-ruby-client >= v0.11.0
- [maintenance] Fix
schma_update_optionswas not set with load_from_gcs (thanks to h10a-bf)
- [maintenance] Fix
schema_update_optionswas not working (nil error)
- [enhancement] Support
schema_update_optionsoption
- [enhancement] Support partitioned table
- [maintenance] Add
progress_log_intervaloption to control the interval of showing progress log, and now showing progress log is off by default
- [maintenance] Fix Thread.new to use thread local variables to avoid nil idx error (thanks to @shyouhei and @umisora)
- [maintenance] if
is_skip_job_result_checkis true, skip output_rows checking (thanks to @joker1007)
- [enhancement] retry backendError and internalError in waiting load job
- [enhancement] retry Broken pipe and Connection reset in inserting object to GCS
- [new feature] Add
gcs_bucketoption to load multiple files from a GCS bucket with one load job
- [maintenance] Fix
private_keyauth is not working
- [new feature] Add
abort_on_erroroption - [maintenance] Use uuid instead of current time for temp_table name
- [new feature] Add
sdk_log_leveloption to show log of google-api-client - [maintenance] Fix
prevent_duplicate_insertwas not working correctly - [maintenance] Change to get
num_output_rowsoftransaction_reportfromget_tableAPI - [maintenance] Log response.statistics of load jobs
- [maintenance] Always create job_id on client side as google recommends so that duplication not to be occurred
- [maintenance] Fix a possibility which rehearsal would load 0 rows file
Big change is introduced. Now, embulk-output-bigquery is written in JRuby.
- [new feature] Support parallel loads. Fix #28.
- [new feature] Create table first. Fix #29.
- [new feature] Introduce rehearsal mode. Fix #30.
- [new feature] Support
dataset_oldoption forreplace_backup. Fix #31. - [maintenance] Fix default timestamp format to
%Y-%m-%d %H:%M:%S.%6. Fix #32. - [new feature] Support request options such as
timeout_sec,open_timeout_sec,retries. Fix #33. - [new feature] Support continuing from file generation with
skip_file_generationoption. - [new feature] Guess BigQuery schema from Embulk schema. Fix #1.
- [new feature] Support automatically create dataset.
- [new feature] Support transactional append mode.
- [incompatibility change] Formatter plugin support is dropped. Formatter is done in this plugin for specified
source_format. - [incompatibility change] Encoder plugin support is dropped. Encoding is done in this plugin for specified
compression. - [incompatibility change]
appendmode now expresses a transactional append, andappend_directis one which is not transactional (this wasappendmode before)
- [maintenance] Fix detect logic of delete_in_advance mode. #26. @sonots thanks!
- [new feature] Added template_table option. #25. @joker1007 thanks!
- [maintenance] Upgraded Embulk version to 0.8.1 #22. @joker1007 thanks!
- [maintenance] Formatted code style by checkstyle #23
- [new feature] Added mode parameters and support 4 modes(append, replace, replace_backup, delete_in_advance). #20 #21 @joker1007 thanks!
- [maintenance] Change error result display for easy investigation. #18
- [new feature] Added new auth method - json_keyfile of GCP(Google Cloud Platform)'s service account #17
- [maintenance] Upgraded Embulk version to 0.7.1
- [new feature] Supported mapreduce-executor. @frsyuki thanks! #13
- [maintenance] Fixed job_id generation logic #15
- [maintenance] Refactored #11
- [new feature] Added allow_quoted_newlines option #10
- [maintenance] Upgraded embulk version to 0.6.8
- [new feature] Added ignore_unknown_values option to job_id generation logic. #9
- [new feature] Added ignore_unknown_values option. #8 @takus thanks!
- [new feature] Added prevent_duplicate_insert option
- [new feature] Added new auth method - pre-defined access token of GCE(Google Compute Engine)
- [maintenance] Updated Google provided libraries
- http-client:google-http-client-jackson2 from 1.19.0 to 1.20.0
- apis:google-api-services-bigquery from v2-rev193-1.19.1 to v2-rev205-1.20.0
- [new feature] Changed bulk-load method from "via GCS" to direct-insert
- [new feature] added dynamic table creationg option