assert: handle TokenTooLong error scenario #1559

arjunmahishi · 2024-03-03T11:16:30Z

Summary

As pointed out in #1525, when the assertion message is too long, it gets completely truncated in the final output. This is because bufio.Scanner.Scan() has a default MaxScanTokenSize set to 65536 characters (64 * 1024). The Scan() function returns false whenever the line being scanned exceeds that max limit. This leads to the final assertion message being truncated.

Changes

This commit fixes that by manually setting the internal scan buffer size to len(message) + 1 to make sure that the above scenario never occurs.

Related issues

Fixes #1525

… messages As pointed out in stretchr#1525, when the assertion message is too long, it gets completely truncated in the final output. This is because, `bufio.Scanner.Scan()` has a default `MaxScanTokenSize` set to `65536` characters (64 * 1024). The `Scan()` function return false whenever the line being scanned exceeds that max limit. This leads to the final assertion message being truncated. This commit fixes that by manually setting the internal scan buffer size to `len(message) + 1` to make sure that above scenario never occurs. Fixes stretchr#1525

assert/assertions_test.go

assert/assertions.go

assert/assertions_test.go

assert/assertions.go

assert/assertions_priv_test.go

Instead of using an arbitrary value like 20000, we can just use the value defined by bufio package (MaxScanTokenSize).

arjunmahishi · 2024-03-07T05:44:17Z

@dolmen @brackendawson Can you please take one last look at this?
I think it's ready to be merged

assert/assertions_priv_test.go

assert/assertions.go

arjunmahishi · 2024-03-08T15:41:23Z

@dolmen Are we good to merge?

dolmen · 2024-03-10T10:39:06Z

I'm not yet convinced that the tests provide enough coverage. I would like to see more clearly a check related to bufio boundaries.

I feel that we are testing with strings much longer than the boundary, and not just below and at the boundary.
I have to dig into the code more carefully.

dolmen · 2024-03-19T23:38:34Z

assert/assertions_priv_test.go

+		},
+		{
+			name:            "single line - just under the bufio default limit",
+			msg:             strings.Repeat("hello ", bufio.MaxScanTokenSize-10),


The boundary is not "hello" repeated bufio.MaxScanTokenSize times, but a string which has a line whose length is bufio.MaxScanTokenSize bytes.

We probably need a utility function to build such an input string. At least strings.Repeat alone doesn't help.

Ohh! that's right.

Changed the approach a little bit. The new commit generates the input as you suggested. And instead of matching against an exact output, it asserts the pattern of the output. Like the leading white spaces etc. Because for generating the expected output, we will end up rewriting the actual function logic in the test.

assert/assertions.go

Also, fix the test cases for this function. This commit generates the input based on parameters like bytes per line and number of lines. The assertion is made against the pattern of the output rather than the exact output.

brackendawson · 2024-04-02T15:35:50Z

assert/assertions.go

+	// than the length of the message to avoid exceeding the default
+	// MaxScanTokenSize while scanning lines. This can happen when there is a
+	// single very long line. Refer to issue #1525
+	msgScanner.Buffer(nil, len(message)+1)


Is this safe? As we know from the assert.Len case this can contain a very large formatted slice. Having more than doubled the memory consumed by the large slice by creating a string containing the formatted version of it. Do we want to double the formatted version again? Plus all the intermediate buffers that were allocated on the heap by msgScanner before the next GC cycle?

We could at least pre-allocate the buffer for msgScanner rather than passing in nil. But I'd prefer that we either split or truncate lines that are unreasonably long, or we could just truncate the entire message to less than bufio.MaxScanTokenSize?

We could at least pre-allocate the buffer for msgScanner rather than passing in nil.

Agreed. I've made this change.

But I'd prefer that we either split or truncate lines that are unreasonably long, or we could just truncate the entire message to less than bufio.MaxScanTokenSize?

If we decide to truncate, we could probably truncate based on the dimensions of the terminal? This makes more sense from a UX point of view.

Hey, @arjunmahishi. I really think we should truncate rather than allocate unknown amounts of memory. It turns out Equal actually already uses a function to truncate its values. Sorry to steal the issue but I've opened #1646 with how I think it could be done. What do you think?

Cool. Better to reuse the existing functionality and keep it consistent. Closing this now.

arjunmahishi requested a review from brackendawson March 3, 2024 11:16

arjunmahishi changed the title ~~assert: handle TokenTooLong error scenario while formatting assertion messages~~ assert: handle TokenTooLong error scenario Mar 3, 2024

assert: add comment that explains the MaxScanTokenSize handling logic

f37ae55

hendrywiranto reviewed Mar 3, 2024

View reviewed changes

assert/assertions_test.go Outdated Show resolved Hide resolved

assert: Fix typos in Test_indentMessageLines

48a148a

dolmen requested changes Mar 4, 2024

View reviewed changes

assert/assertions.go Outdated Show resolved Hide resolved

assert/assertions_test.go Outdated Show resolved Hide resolved

dolmen added enhancement pkg-assert Change related to package testify/assert pkg-require Change related to package testify/require bug and removed enhancement pkg-require Change related to package testify/require labels Mar 4, 2024

assert: move Test_indentMessageLines into a separate file

f24d768

arjunmahishi requested a review from dolmen March 4, 2024 18:04

dolmen requested changes Mar 5, 2024

View reviewed changes

assert/assertions.go Outdated Show resolved Hide resolved

assert/assertions.go Outdated Show resolved Hide resolved

assert/assertions_priv_test.go Outdated Show resolved Hide resolved

assert: make Test_indentMessageLines more deterministic

6b0dfad

Instead of using an arbitrary value like 20000, we can just use the value defined by bufio package (MaxScanTokenSize).

arjunmahishi requested a review from dolmen March 6, 2024 04:18

dolmen requested changes Mar 7, 2024

View reviewed changes

assert/assertions_priv_test.go Outdated Show resolved Hide resolved

assert/assertions.go Outdated Show resolved Hide resolved

assert: indentMessageLines - build message with bytes instead of string

594ac7d

arjunmahishi requested a review from dolmen March 7, 2024 14:40

assert: add more test cases to Test_indentMessageLines

f341282

dolmen requested changes Mar 19, 2024

View reviewed changes

dolmen reviewed Mar 20, 2024

View reviewed changes

assert/assertions.go Outdated Show resolved Hide resolved

assert: pre-compute the indent spaces in indentMessageLines

b83d206

Also, fix the test cases for this function. This commit generates the input based on parameters like bytes per line and number of lines. The assertion is made against the pattern of the output rather than the exact output.

brackendawson reviewed Apr 2, 2024

View reviewed changes

assert: reduce memory allocations in indentMessageLines

9e31964

brackendawson mentioned this pull request Oct 3, 2024

Truncate very long objects in test failure messages #1646

Open

arjunmahishi closed this Oct 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assert: handle TokenTooLong error scenario #1559

assert: handle TokenTooLong error scenario #1559

arjunmahishi commented Mar 3, 2024

arjunmahishi commented Mar 7, 2024

arjunmahishi commented Mar 8, 2024

dolmen commented Mar 10, 2024

dolmen Mar 19, 2024

arjunmahishi Mar 21, 2024

brackendawson Apr 2, 2024

arjunmahishi Apr 4, 2024

brackendawson Oct 3, 2024

arjunmahishi Oct 8, 2024

assert: handle TokenTooLong error scenario #1559

assert: handle TokenTooLong error scenario #1559

Conversation

arjunmahishi commented Mar 3, 2024

Summary

Changes

Related issues

arjunmahishi commented Mar 7, 2024

arjunmahishi commented Mar 8, 2024

dolmen commented Mar 10, 2024

dolmen Mar 19, 2024

Choose a reason for hiding this comment

arjunmahishi Mar 21, 2024

Choose a reason for hiding this comment

brackendawson Apr 2, 2024

Choose a reason for hiding this comment

arjunmahishi Apr 4, 2024

Choose a reason for hiding this comment

brackendawson Oct 3, 2024

Choose a reason for hiding this comment

arjunmahishi Oct 8, 2024

Choose a reason for hiding this comment