97 pointsby birdculture4 months ago14 comments
  • Animats4 months ago
    Overlap in multiple dimensions is simply the AND of overlap in each dimension.

    The 3D case comes up in collision detection.

    For collision detection in games, the objects are usually kept in a sorted order, with separate lists for X, Y and Z. Amusingly, a bubble sort is useful, because, as objects move, they tend to move locally, so a bubble sort quickly restores the order. The sorting algorithm should terminate quickly when there are few or no changes. First seen in I-Collide, 1995. When objects are moving slowly, speed is slightly worse than O(N), but degrades if there's too much motion.

    2D sorting speeds things up. If you sort the intervals by start X, start Y, you can process the intervals sequentially. Here's something of mine which does that.[1] A MySQL database does the sort, then feeds the data to this algorithm. Overlaps are detected, sets of overlapping objects are merged, and the sets of overlapping 2D rectangles are emitted. Sort is O(N log N) as usual, and overlap detection is O(N).

    [1] https://github.com/John-Nagle/maptools/blob/main/rust/src/ge...

    • efavdb4 months ago
      >>Overlap in multiple dimensions is simply the AND of overlap in each dimension.

      Presumably just for boxes aligned with the axes (or some other condition?)? EG two lines can have x’s in common and y’s but not overlap if they are sloped at some angle.

      • Animats4 months ago
        Right, axis-aligned bounding boxes.

        Most collision detection systems use axis-aligned bounding boxes as a filter. Then more detailed algorithms are used on possibly-colliding objects.

        • efavdb4 months ago
          Interesting, thanks!
    • animal5314 months ago
      Have you tried prefix-sum? In my game code I used the Nvidia key/offset sort for a long time, but I've since replaced that for all of my physics and spatial queries.

      There are cases where even though the sort executes more instructions that the size of the elements/code still fits into some Ln cache level and makes it faster, but in general the prefix approach comes out ahead.

    • 4 months ago
      undefined
  • srean4 months ago
    This seemingly no-brainer of a task becomes more intellectually interesting than what you may think on first contact (pun intended).

    More so when you have to distinguish between the different types of overlap and non overlap and carry through the reasoning over a chain of overlap/no-overlap relations. I sure underestimated it.

    The one dimensional case is covered(there you go again) by Allen algebra. The more richer notion is that of topological relations. I will find the Wikipedia pages and post.

    https://en.wikipedia.org/wiki/Allen%27s_interval_algebra

    https://en.wikipedia.org/wiki/Region_connection_calculus

    https://en.wikipedia.org/wiki/Spatial_relation

    https://en.wikipedia.org/w/index.php?title=DE-9IM

    Interval trees, range trees help if you have a large static set of interval like objects against which you have to relate a query object.

  • Galanwe4 months ago
    My work involves a lot of time series analysis, as such I'm often dealing with intervals.

    While the overlap algorithm (or rather "condition") is cute, there a lot more "cool" stuff to do with intervals, which I would have liked to see in there.

    - Checking whether multiple intervals overlap

    - Checking whether multiple intervals are contiguous

    - Merging contiguous intervals

    - Etc..

    From experience, something is also crucial when working with intervals: trivially knowing which boundaries are closed and which are opened. I found that defining a strict vocabulary helps a lot here. e.g. "last" is "inclusive", while "end" is exclusive.

    [closed; opened[ intervals are also the best when dealing with time intervals (if that makes sense in your use case), because you can trivially join them.

    • Terr_4 months ago
      Hmm. I imagine that determining which intervals can be picked to make a continuous span is really a graph-traversal algorithm.

      However you aren't just given all the existing edges (pair overlaps) in advance, maybe there's a way to have the graph-exploration side guide the edge-detection to minimize work.

    • ambicapter4 months ago
      You should write that blog post.
  • teddyh4 months ago
    An older, and IMHO slightly more authoritative, source: <https://wiki.c2.com/?TestIfDateRangesOverlap>

    See also: <https://martinfowler.com/eaaDev/Range.html>

    • rawling4 months ago
      > older

      Hey, _I'm_ a source that's older than that one: <https://stackoverflow.com/a/13513973>

      Not so sure about "more authoritative", though.

      • CaptainOfCoit4 months ago
        Bold to claim something you've authored on Stack Overflow is older than C2, the og wiki.

        The earliest version I could find on IA is from 2003 (https://web.archive.org/web/20030606033520/http://c2.com/cgi...), last edited in 2002 at that point, but wouldn't surprise me to page was initially created in the 90s.

        • rawling4 months ago
          Hah, damn. I just saw the date at the bottom and ignored the bit that said "last edit".
          • CaptainOfCoit4 months ago
            No worries :) Do go through that entire Wiki if you haven't before, somewhat of an gold mine of good (but old) programming tips and tricks. Some information is a bit dated at this point though, but I still think it's one of the best resources out there.
  • joshlk4 months ago
    R-Trees are a good data structure to use in this case, enabling you to query a collection of intervals for overlap with another in O(log(n)) time.

    Wikipedia: https://en.wikipedia.org/wiki/R-tree

  • senderista4 months ago
    I find it interesting that I don't have a good intuition for the simple condition; instead I have to follow something like the process in the article whenever I want to re-derive that condition.
    • tirutiru4 months ago
      I wonder how many completely u related applications have that interval check logic coded up somewhere. I'm pretty sure I wrote one for my work codebase. Would I bet my life that the < and <=s are correct? Nope.
      • Animats4 months ago
        That's what unit tests are for.
        • senderista4 months ago
          or better yet, property tests
  • penguin_booze3 months ago
    "Invert, always invert" -- Carl Jacobi
  • dekhn4 months ago
    I've written code like this to work with overlapping genes. While most genes exist in a genome with spacers between them and their neighboring genes, sometimes you get pairs of genes which overlap, and there seems to be some interesting biology that happens as a consequence.

    Here's a package for Python that presumably uses some sort index data structure to be efficient: https://pyranges.readthedocs.io/en/latest/

  • matu3ba4 months ago
    Nice introduction.

    1. Please always make closed and open interval explicit on all code examples. "Detecting overlap" is ambiguous and open intervals have no given solution in the article, if I'm not mistaken. 2. How do you define the empty interval on floating point numbers? How do you define an open interval on floating point numbers? Number representation, input range etc can be very important.

    Disclosure: Did some stupidly crazy time series eval for OCPP1.6 and OCPP2.01 charging profiles.

  • SyzygyRhythm4 months ago
    Even with the visualization, I found the minimal solution hard to visualize. I came up with this instead:

    Suppose you start with two separated intervals. The left one starts sliding rightward. At what point do they contact? That's easy, it's just when (end1 > start2).

    As it continues sliding, at what point do they lose contact? Again, easy: it's where (start1 >= end2).

    So the solution is the first condition and the negation of the second, i.e.: (end1 > start2) && (start1 < end2)

  • supportengineer4 months ago
    I had to do this with 1-day granularity in SQL so we created a Day dimension table and just did a join to detect the overlapping days.
  • fjfaase4 months ago
    Now write some code to manage collectons of intervals and operations, such as finding if a value is included in a collection of intervals and operations for merging two collections of intervals. What is the best data structure to be used? Explain why?
  • tim3334 months ago
    Spacetime overlaps can depend on the velocity of the observer if far enough apart. Not sure if it's worth figuring that into the code.
  • OptionOfT4 months ago
    I'm saving this for Advent Of Code 2025.
    • sour-taste4 months ago
      Overlapping intervals were a question back in 2023 so yeah, it's useful