uBlock/platform
Raymond Hill c6fb70b1f0
Refactor hntrie to avoid the need for boundary cells
Whereas before the string segment was encoded as:

LL OOOOOOOOOOOO

where L are the upper 8 bits and used to encode the length
of the segment, and O are the lower 24 bits and used to
encode the offset of the string data in the character
buffer, the new code encode as follow:

OOOOOOOOOOOO LL

And furthermore the most significant bit of the length
LL is now used to mark whether the current string segment
is a label boundary.

This means a cell can't reference a segment longer then
127 characters. To work around this limitation for when a
segment is longer than 127 characters (a rare occurrence),
the algorithm will simply split the segment into multiple
adjacent cells.

As a result, there is no longer a need to encode
"boundariness" into special cells, which simplifies
both the storing and matching algorithms.

Additionally, added minimal documentation for the NPM
package on how to import and use HNTrieContainer as a
standalone API.
2021-08-10 09:27:59 -04:00
..
browser Group all compiling-related code into FilterCompiler() class 2021-08-04 15:14:48 -04:00
chromium Modularize codebase with export/import 2021-07-27 17:26:04 -04:00
common Fix jshint warnings 2021-08-03 12:08:59 -04:00
firefox Increase minimum version to Firefox 60.0 2021-08-08 10:47:04 -04:00
nodejs Refactor hntrie to avoid the need for boundary cells 2021-08-10 09:27:59 -04:00
opera Modularize codebase with export/import 2021-07-27 17:26:04 -04:00
safari Clarify that the code base was really old and obsolete 2019-09-22 09:57:38 -04:00
thunderbird Modularize codebase with export/import 2021-07-27 17:26:04 -04:00