fix(bindings): correct indices for `Node::utf16_text` #4619

WillLillis · 2025-07-16T05:15:45Z

Node::utf16_text uses byte offsets to index into a slice of u16s. These indices need to be adjusted.

I added handling in case these adjusted indices happen to fall on a utf16 surrogate. I'm not sure how to construct such a pathological test case that has a node's text boundary split on a surrogate pair, but I think it's better to be safe than sorry in this case.

Alternatively, we could just use String::from_utf16, however this would return owned data (also making the requisite heap allocation), as well as break parity with Node::utf8_text's return type.

Closes utf16_text index out of range after parse_utf16_le in Rust #4616

ObserverOfTime · 2025-07-16T08:02:59Z

We (I?) normally use the bindings scope for the binding templates. I suggest using rust instead.
We should probably document these at some point…

lib/binding_rust/lib.rs

tree-sitter-ci-bot · 2025-08-02T20:04:10Z

Successfully created backport PR for release-0.25:

fix(bindings): correct indices for Node::utf16_text #4663

WillLillis force-pushed the utf16_range branch from 083f508 to 6709137 Compare July 16, 2025 12:38

ribru17 reviewed Jul 16, 2025

View reviewed changes

lib/binding_rust/lib.rs Outdated Show resolved Hide resolved

fix(rust): correct indices for Node::utf16_text

d5be325

WillLillis force-pushed the utf16_range branch from 6709137 to d5be325 Compare July 17, 2025 01:56

ribru17 approved these changes Jul 22, 2025

View reviewed changes

WillLillis added the ci:backport release-0.25 Backport label label Jul 26, 2025

clason approved these changes Aug 2, 2025

View reviewed changes

WillLillis merged commit d3c2fed into tree-sitter:master Aug 2, 2025
19 checks passed

WillLillis deleted the utf16_range branch August 2, 2025 20:03

tree-sitter-ci-bot bot mentioned this pull request Aug 2, 2025

fix(bindings): correct indices for Node::utf16_text #4663

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(bindings): correct indices for `Node::utf16_text` #4619

fix(bindings): correct indices for `Node::utf16_text` #4619

Uh oh!

WillLillis commented Jul 16, 2025

Uh oh!

ObserverOfTime commented Jul 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

tree-sitter-ci-bot bot commented Aug 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

fix(bindings): correct indices for Node::utf16_text #4619

fix(bindings): correct indices for Node::utf16_text #4619

Uh oh!

Conversation

WillLillis commented Jul 16, 2025

Uh oh!

ObserverOfTime commented Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tree-sitter-ci-bot bot commented Aug 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix(bindings): correct indices for `Node::utf16_text` #4619

fix(bindings): correct indices for `Node::utf16_text` #4619

ObserverOfTime commented Jul 16, 2025 •

edited

Loading