chore: support addition between a timestamp and a timedelta by sycai · Pull Request #1369 · googleapis/python-bigquery-dataframes

sycai · 2025-02-06T03:25:04Z

No description provided.

tswast · 2025-02-06T20:52:24Z


    scalar_expr = bigframes_vendored.ibis.literal(literal)
-    if ibis_dtype:
+    if isinstance(literal, datetime.timedelta):


Should we allow pandas, numpy, and pyarrow scalars here, too?

Suggested change

if isinstance(literal, datetime.timedelta):

if isinstance(literal, (datetime.timedelta, numpy.timedelta64, pandas.Timedelta, pyarrow.DurationScalar)):

Sure! Pandas timedelta is a subclass of datetime.timedelta, but I can make it more explicit.

Added support for numpy.

I eventually decided not to implement pyarrow duration because:

It causes more unrelated failures than I expected in Python 3.9 and 3.10 envs for some reasons.

Pandas does not support adding pyarrow duration literals to series

Maybe we can add support for pyarrow in the future, but at this moment it may not be worth it

tswast · 2025-02-06T20:54:29Z

+def timedelta_to_micros(td: typing.Union[pd.Timedelta, datetime.timedelta]) -> int:
+    if isinstance(td, pd.Timedelta):
+        # td.value returns total nanoseconds.
+        return td.value // 1000


Looks like pandas.Timedelta has units: https://pandas.pydata.org/docs/reference/api/pandas.Timedelta.html

Please make this robust to various units.

that is for the constructor arg, but as per the docs, "The .value attribute is always in ns."

Right. Pandas always converts the input to values in nanoseconds, no matter what unit we use in the constructor.

tswast · 2025-02-06T20:56:20Z

        return DATE_DTYPE
    if issubclass(type, datetime.time):
        return TIME_DTYPE
+    if issubclass(type, datetime.timedelta):


Likewise, do we need numpy, pandas, and pyarrow object detection here too?

Done. Left the PyArrow out for this one.

See https://github.com/googleapis/repo-automation-bots/blob/main/packages/owl-bot/README.md

TrevorBergeron · 2025-02-07T17:58:45Z

+    if isinstance(
+        literal,
+        (datetime.timedelta, pd.Timedelta, numpy.timedelta64),
+    ):
+        # numpy timedelta is compatible with Ibis, so we process them separately.
+        return bigframes_vendored.ibis.literal(
+            utils.timedelta_to_micros(literal), ibis_dtype


Why don't we just replace literal expressions (as well as column defs in leaves) in the tree when we replace all the ops?

Good call. I moved the conversion to the rewrite module

I am going to rename the module rewrite.operators to something like rewrite.timedelta_expression to better reflect its behavior, perhaps in another PR

TrevorBergeron · 2025-02-07T18:02:42Z

+@scalar_op_compiler.register_binary_op(ops.timestamp_add_op)
+def timestamp_add_op_impl(x: ibis_types.TimestampValue, y: ibis_types.IntegerValue):
+    return x + y.to_interval("us")


What does this syntax look like from this? Is it a clean TIMESTAMP_ADD(timestamp_col, INTERVAL duration MICROSECOND)?

Yes https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions#timestamp_add

Though this has less to do with SQL syntax than the ease of wiring code. Ibis does support interval + timestamp and timestamp + interval, but that means I also need to check which side of the input is integer here.

If I can standardize the order of types in the rewrite module, then I don't need to check ibis type here :)

chore: support addition between a timestamp and a timedelta

eefffbc

product-auto-label bot added size: l Pull request size is large. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Feb 6, 2025

sycai and others added 3 commits February 6, 2025 03:26

test_timestamp_dff

1f3a6a5

Merge branch 'main' into sycai_timestamp_add

af3809d

fix conftest.py

d089a35

sycai marked this pull request as ready for review February 6, 2025 17:48

sycai requested review from a team and tswast February 6, 2025 17:48

blunderbuss-gcf bot assigned jialuoo Feb 6, 2025

sycai requested review from TrevorBergeron and chelsea-lin February 6, 2025 17:48

Merge branch 'main' into sycai_timestamp_add

c79460d

tswast reviewed Feb 6, 2025

View reviewed changes

sycai and others added 5 commits February 7, 2025 00:12

support numpy and pyarrow timedelta literals

495ccb2

🦉 Updates from OwlBot post-processor

e83d936

See https://github.com/googleapis/repo-automation-bots/blob/main/packages/owl-bot/README.md

Merge branch 'main' into sycai_timestamp_add

1e51e63

fix format

fd38454

use local fixture for testing

c680948

sycai force-pushed the sycai_timestamp_add branch from 146c690 to c680948 Compare February 7, 2025 04:23

sycai added 3 commits February 7, 2025 17:16

Remove pyarrow duration scalar support.

fc760fd

fix format

3003398

remove redundant imports

b5b69cc

sycai force-pushed the sycai_timestamp_add branch from c9a5021 to b5b69cc Compare February 7, 2025 17:21

fix mypy

00e15db

TrevorBergeron reviewed Feb 7, 2025

View reviewed changes

update timedelta literals during tree rewrites

4efc943

sycai requested a review from tswast February 7, 2025 18:34

sycai requested a review from TrevorBergeron February 7, 2025 18:34

sycai and others added 5 commits February 7, 2025 19:41

update type conversions in tests to make py 3.9 happy

2b506b5

fix add operator for integers

6dbb790

Merge branch 'main' into sycai_timestamp_add

962c40d

Merge branch 'main' into sycai_timestamp_add

1029306

Merge branch 'main' into sycai_timestamp_add

1b23d9c

TrevorBergeron approved these changes Feb 10, 2025

View reviewed changes

sycai merged commit b598aa8 into main Feb 10, 2025

sycai deleted the sycai_timestamp_add branch February 10, 2025 23:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: support addition between a timestamp and a timedelta#1369

chore: support addition between a timestamp and a timedelta#1369
sycai merged 20 commits intomainfrom
sycai_timestamp_add

sycai commented Feb 6, 2025

Uh oh!

tswast Feb 6, 2025

Uh oh!

sycai Feb 7, 2025 •

edited

Loading

Uh oh!

tswast Feb 6, 2025

Uh oh!

TrevorBergeron Feb 7, 2025

Uh oh!

sycai Feb 7, 2025

Uh oh!

tswast Feb 6, 2025

Uh oh!

sycai Feb 7, 2025

Uh oh!

TrevorBergeron Feb 7, 2025

Uh oh!

sycai Feb 7, 2025

Uh oh!

sycai Feb 7, 2025

Uh oh!

TrevorBergeron Feb 7, 2025

Uh oh!

sycai Feb 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	if isinstance(literal, datetime.timedelta):
	if isinstance(literal, (datetime.timedelta, numpy.timedelta64, pandas.Timedelta, pyarrow.DurationScalar)):

Conversation

sycai commented Feb 6, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sycai Feb 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sycai Feb 7, 2025 •

edited

Loading