It's time to move your docs in the repo(www.dein.fr)

115 pointsby gregdoesit3 days ago20 comments

susam3 days ago
It is a bit weird to see LLMs suddenly being presented as the reason to follow what are basically long standing best practices.
'You must write docs. Docs must be in your repo. You must write tests. You must document your architecture. Etc. Etc.'
These were all best practices before LLMs existed and they remain so even now. I have been writing extensive documentation for all my software for something like twenty years now, whether it was for software I wrote for myself, for my tiny open source projects or for businesses. I will obviously continue to do so and it has nothing to do with:
> AI changes the game
The reason is simply that tests and documentation are useful to humans working on the codebase. They help people understand the system and maintain it over time. If these practices also benefit LLMs then that is certainly a bonus, but these practices were valuable long before LLMs existed and they remain valuable even now regardless of how AI may have changed the game.
It is also a bit funny that these considerations did not seem very common when the beneficiaries were fellow human collaborators, but are now being portrayed as very important once LLMs are involved. I'd argue that fellow humans and your future self deserved these considerations even more in the first place. Still, if LLMs are what finally motivate people to write good documentation and good tests, I suppose that is a good outcome since humans will end up benefiting from it too.
- seanwilson2 days ago
  > It is a bit weird to see LLMs suddenly being presented as the reason to follow what are basically long standing best practices.
  Maybe it's the speed of LLM iteration that makes the benefit more immediately obvious, vs seeing it unfold with a team of people over a longer time? It's almost like running a study?
  I have a similar reaction to strong static types being advocated to help LLMs understanding/debugging code, catching bugs, refactoring... when it's obvious to me this helps humans as well.
  Curious how "this practice helps LLMs be more productive" relates to studies that try to show this with human programmers, where running convincing human studies is really difficult. Besides problems with context sizes, are there best practices that help LLMs a lot but not humans?
  - codethief2 days ago
    Agreed. LLM usage just makes the impact more visible.
    I bet one of the next "revelations" is going to be: Avoid sprinkling side effects throughout your code. Prefer functional code, push side effects to the boundary (functional core, imperative shell).
- forrestthewoods2 days ago
  > It is a bit weird to see LLMs suddenly being presented as the reason to follow what are basically long standing best practices.
  About 95% of the work needed to make LLMs happy is just general purpose better engineering. Units tests? Integration tests? CI? API documentation? Good example? All great for humans too!
  I consider this largely a good thing. It would be much worse if the changes needed for Happy LLMs were completely different than what you want for Happy Humans! Even worse would be if they were mutually exclusive.
  It's a win. I'll take it.
  - hex-m2 days ago
    Reminds me of the "semantic web". Making content machine-readable has positive side effects. (clear structure, reduced ambiguity, separation of data and presentation)
    It's another example where the reason for better engineering is to make machines (search engines) happy.
- ronsor2 days ago
  AI means that you cannot defer software design until you've written half code; you cannot defer documentation to random notes at the end.
  It has the effect of finally forcing people to think about the software they're making, assuming they care about quality. If they didn't, then it's not practically different from an insecure low-code app or something copy-pasted from 15 year old StackOverflow answers.
  - lelanthran2 days ago
    > AI means that you cannot defer software design until you've written half code; you cannot defer documentation to random notes at the end.
    > It has the effect of finally forcing people to think about the software they're making,
    Ah, and all this time I was reliably assured that waterfall, design-upfront, was a broken process...
    kingstnap2 days ago
    Single iteration waterfall is a broken process. You really need those late stage usage feedback signals unless your requirements were somehow captured by God.
    lelanthran21 hours ago
    > Single iteration waterfall is a broken process. You really need those late stage usage feedback signals unless your requirements were somehow captured by God.
    Waterfall, as prescribed and as practiced, had feedback signals.
- palmotea2 days ago
  > It is also a bit funny that these considerations did not seem very common when the beneficiaries were fellow human collaborators, but are now being portrayed as very important once LLMs are involved. I'd argue that fellow humans and your future self deserved these considerations even more in the first place.
  The reason might have been a cultural regression. At least with documentation, it seems to have been much better and a bigger priority a couple decades ago.
  At the start of my career, teams produced documents as a part of their work, and there were even technical writers on staff. Then agile hit, the writers were laid off, much of what little documentation that was created was kept in various work-tracking systems and wikis that were periodically replaced, often with little to no migration.
- basilikum3 days ago
  > The reason is simply that tests and documentation are useful to other humans working on the codebase.
  Including future you
  - cindyllm3 days ago
    [dead]
- benatkin3 days ago
  Well, it's timely because there's a docs platform that has surged in popularity, and it really is not a good idea for most of those who need technical docs to be using a SaaS that approximates Squarespace.
- starkparker2 days ago
  As a tech writer, it's not surprising. All LLMs did was get PMs on the same page as TWs, devs, and support toward prioritizing it, because now it benefits feature development in the short term instead of the long term. They can put it on a quarterly review slide and in an OKR, and their bosses will jump up and down and squeal and give them a raise instead of shooting them in the face.
- jamie_ca2 days ago
  I see this as not just internal API/architecture/code documentation, but product documentation too. We maintain internal docs about how our product is used for our support, implementation, and sales teams to reference.
  Right now it's hosted externally (in our "blessed" knowledge base) but if it could be pulled into the repo, and we set an AI reviewer on every pull request to sanity check that if the changes we're doing have a material impact on the feature as described in those docs that it should be flagged (or changes proposed) that'd be a nice win for keeping them up-to-date, and it's easy enough to publish markdown as html or even script an update to the canonical site when we merge to main.
- sublinear2 days ago
  I agree and would go one step further. The way people are now talking to LLMs to write code is the way we need them to plan and discuss in meetings with humans.
  Everything regarding AI-assisted development is basically training wheels for the young people coming into the workplace.
- formerly_proven2 days ago
  There's a pattern where people create AI-specific infrastructure for coding agents which is essentially instantly obsolete because it's pointless. Stuff like most MCPs (instead of just using a CLI), agent-specific files (CLAUDE.md, AGENTS.MD, github-instructions.NET etc.) etc.
  > You should have a good, concise introduction to the codebase that allows anyone to write and test a simple patch in under 15 minutes.
  Yeah, that's the CONTRIBUTING file.
- j452 days ago
  LLMs are making it more possible to maintain.
- ErroneousBosh2 days ago
  > These were all best practices before LLMs existed and they remain so even now
  Okay, so what, should I be moving my docs out of the repo or something?
  How should I make it as hard as possible for LLMs to make any use of or suggestions about my documentation?
- pooploop642 days ago
  Lately I have seen a lot of things coming full circle like this in a way that always seems positive for humans as well.
  Many doomers are running around saying the future is grim because everything will be made for AI agents to use rather than humans. But so far everything done to push that agenda has looked more like a big de-enshittification.
  Another one is Model Context Protocol, which brings forth the cutting edge (for 1970) idea of using a standard text based interface so that separate programs can interoperate through it.
  If the cost of having non-user-hostile software is to let AI bros run around thinking they invented things like stdin and documentation, I'm all for it at this point.
  If any AI bros are reading this here's another idea. Web pages that use a mostly static layout and a simple structure would probably be a lot easier for AI to parse. And google, it would be really beneficial to AI agents if their web searches weren't being interfered with by clickjacking sites such as Pinterest.
- 8note2 days ago
  there's an implicit ownership change, from having technical writers own the documentation, to including it as part of the commit.
  when things are tiny or resource constrained, the same people are doing each task regardless, but "technical writer" is a real job around documentation and manual writing, so there's at least sometimes some real decoupling between the code and outwards facing documents.
  that also covers for cases where people can write code well, but who's english(or whatever the target documentation language is) is shaky at best.
- 3 days ago
  undefined
- 2 days ago
  undefined
redgridtactical3 days ago
The biggest win for me with docs-in-repo isn't the AI angle, it's that pull requests can't land without updating the relevant docs. When your support pages, privacy policy, and README all live in the same repo, they naturally stay in sync with the code.
GitHub Pages serving directly from a /docs folder makes it even simpler, no separate deploy, no separate CMS, no drift. The less infrastructure between writing and publishing, the more likely docs actually get maintained.
themanmaran3 days ago
We just did this the other week and it's such a great setup using AI. Monorepos in general are better for coding agents since it's a single location to search. But now we have the ability to say "Add xyz optional param to our API" and claude adds the code + updates the documentation. I was also able to quickly ask "look at our API and our docs, find anything out of date".
Our set up is:
```
  packages/

  ↳ server

  ↳ app

  ↳ docs
```
Using mintlify for the docs, just points to the markdown files in the docs folder. And then a line in the claude.md to always check /docs for updates after adding new code.
- dheera3 days ago
  The one thing I hate about monorepos is nothing ever gets versioned, packaged, and shipped.
  Polyrepos are workable, the way to do it is to actually version, ship, and document every subcomponent. When I mean ship, I really mean ship, as in a .deb package or python wheel with a version number, not a commit hash. AI can work with this as well, as long as it has access to the docs (which can also be AI-generated).
  - giorgioz3 days ago
    The monorepo make it easier to ship the overall product but harder to ship parts of it. I've used a monorepo for the past 13 years and I got all shared packages with version 0.0.0 and I still haven't figured out a simple way to share just some parts of it like a CLI. Does anyone have a monorepo and publishes NPM packages with source code of only that folder? Sub-gits required to pull in multiple places...
  - dezgeg2 days ago
    The best thing about monorepos is nothing ever gets versioned and packaged.
    That means, a subcomponent can just make a needed change in the supercomponent as well, and test and the ship the subcomponent without excess ceremonies and releases.
  - fixxation923 days ago
    I've got about ~15 repos for a project and I just start Claude Code in the parent directory of all of them, so it has clear visibility everything and cross-reference whatever it needs.... super handy.
- giorgioz3 days ago
  Yes it's awesome! I'm creating a lot of CLIs with Claude Code to interact with external services. Yesterday made a CLI for the Google Search Console so I can prompt "get all problems from indexing in Google Search Console and fix them". Same with Sentry bugs. Same with the customer support "Use the the customer support cli skill to get recent conversations from customers and rank bug reporting and features requests and suggest things to work on"
  - serial_dev3 days ago
    Sentry MCP is great, “find out top 10 issues by users affected, check what it would take to fix and if you think it’s a low risk fix, apply it. Open a PR that links to the issues and explain the issue and the fix in three sentences max”.
- 8note2 days ago
  couldnt the docs be a build output rather than a dedicated folder? keep the docs close to the code they document?
prepend3 days ago
That time was like 10 years ago. I think it’s been best practice to have docs in the repo for a long time.
GitHub Pages came out in 2008.
- bryanlarsen3 days ago
  The best time to plant a tree was 30 years ago. The second best time is now.
  - prependa day ago
    Isn’t the second best time 29 years and 364 days?
    I get the principle but it seems like there are many other times to plant than now.
  - 9rx2 days ago
    The best time to move your docs to your repo was 30 years ago. But now that they are written by LLMs, tomorrow's LLM will be able to write an even better doc than today's LLM. Nothing is gained in caching them now.
    bryanlarsen2 days ago
    If that's true, you've got the wrong stuff in your docs. Capture the why's. LLM's can synthesize the what's and how's.
    9rx2 days ago
    Unless you're doing something wrong, the "why" is already captured in your test suite/type system. While you can fairly call that documentation (that is the point of it!), the linked story is about natural language documentation. That can be extracted from your tests/types at will, and as models keep getting better extracting later will be better than extracting now.
    bryanlarsen2 days ago
    That's great, if you only want to see the trees. Views of the forest are important too.
    9rx2 days ago
    Don't get me wrong, I'm sure future historians appreciate when you document that John, when trying to determine how to figure out how to exchange data in your application, took a drink from his Diet Coke that happened to have a fly in it and as he spit it out eureka struck: Data can be pushed to the client like a firehose.
    But let's be real, that's never going to happen. No matter what format you give someone, they're not going to write that kind of thing down. They will, however, encode why a firehose approach is necessary to both document it for future readers and to ensure that the program doesn't accidentally (or possibly on purpose by an eager junior dev) move to, say, a pull method that won't meet the business/technical requirements. Which LLMs can extract a natural language version from.
    And, really, that's the only "why" that actually matters to other developers trying to get their job done. The forest, while perhaps full of fun stories that I am sure are entertaining to read, doesn't matter all that much.
  - rdmuser3 days ago
    It's a good saying but the literal meaning is not entirely correct anymore. Climate change has changed the math on tree planting in a few ways. For example tree planting in your area today may backfire vs 30 years ago: https://www.scientificamerican.com/article/forest-preservati...
- WillAdams2 days ago
  1984 was a bit more than 10 years ago:
  http://literateprogramming.com/
  c.f.,
  https://news.ycombinator.com/item?id=47300747
- 8note2 days ago
  if you aren't using github because you're at a big company with its own git UIs, you still have to make the case that your company needs this thing.
  I'm sure there's a ton of places where its been hard to do that in the past in a way where the docs are easily accessible where people are looking
  but with agents, even just pulling the code package to get the docs is fine
  - prependa day ago
    I use Jekyll and other builders independent of GitHub. So even weird companies with other git repos can still autogenerate docs.
- giorgioz3 days ago
  that's true. Take care because in the YCombinator there is "Don't be snarky". Ask yourself how you could have provided the same useful insight without being snarky: https://news.ycombinator.com/newsguidelines.html
  - satvikpendem3 days ago
    I don't see anything snarky about their comment. That rule is for cases where people are overly sarcastic and argumentative, not for comments like the above.
    giorgioz2 days ago
    snarky critical or mocking in an indirect or sarcastic way.
    I think the constructive information is mentioning Github Pages were built on .md file and they have existed for 10 years. That's true and useful.
    The snarky part happens when phrasing the sentence "Ah you guys only doing this now? It has existed for 10 years" that part is talking about something constructive but it's also trying to diminish the discovery of someone else. We all find and develop at different times. In general is better to be constructive and happy someone else is joining you in using something than underline how it was already done in some way by you or others and this new discovery is irrelevant. Everything that exists already existed (in some other shape or smaller parts).
    prependa day ago
    I disagree. Sometimes it’s useful to point out dumb things. As long as it’s not cruel. I think being efficient in speech is helpful and no need to be constructive with silly or damaging comments.
    I don’t think this is a case of just finding something late, it’s finding something decades after it’s very common. And strange that the author didn’t reference this old practice.
  - prependa day ago
    I don’t think my comment is any snarkier than yours.
    My intent wasn’t snark, but more incredulity at how the post was written and how it states some very old, commonly accepted best practice was novel or finally time to use.
    I’d make a similar comment if someone posted that now is the time to use object oriented programming. And I wouldn’t do so in a snarky way.
- mixmastamyk3 days ago
  Python Sphinx as well.
xixixao3 days ago
We have been on this path at work. But I challenge everyone to consider what you lose with MD vs Confluence (et al). It is NOT easier to author, comment on, label, view history of, move without breaking links, etc. markdown docs vs Confluence. If I am the sole author plus my AI and the scope is narrow (a library), I go for MD. But for a big org, process docs, fast iteration… I’m not convinced, until someone builds equally powerful editing UI on top of MD files.
petcat3 days ago
Out-of-band docs have always been a constant source of frustration and discrepancies. It's really difficult to keep readme.com docs updated with actual code releases because there's no hard constraint preventing one from updating without the other. It just relies on "convention".
- thangalin3 days ago
  > difficult to keep [...] docs updated with actual code
  I used my software and R Markdown documents to help address such problems. In the source code, you have:
  // DOC SNIPPET BEGAN: example_api_usage /** */ function amazing_function( char life, long universe, string everything ) { } // DOC SNIPPET ENDED
  In the R Markdown you write an R function to parse all snippets, then refer to snippets by name. If the snippet can't be found, building the documentation fails, and noisily breaks a CI/CD pipeline.
  What's nice is that you can then use this to parse C++ definitions into Markdown tables to render nicely formatted content.
  The general idea is that you can have "living" documentation reference source code and break on mismatch. Whether you use knitr/pandoc or python or KeenWrite/R Markdown[1] is an implementation detail.
  [1]: https://keenwrite.com/
  - moritz2 days ago
    In the Elixir ecosystem (where documentation is considered a "first-class citizen" in the language), you can run code examples as part of your test suite in a similar fashion ("doctest"): https://elixir-recipes.github.io/testing/doctests/
    thangalina day ago
    > documentation is considered a "first-class citizen"
    How exquisitely Knuthian!
theletterf3 days ago
There’s an irresistible, almost demoralizing irony in the fact that developers are discovering docs and accessibility only now due to AI. They needed docs and didn’t know it until they had at their disposal an ersatz user in the form of an LLM that asked for context.
https://passo.uno/skills-are-docs/
- dsjoerg3 days ago
  Strongly agreed. However, some developers have trouble writing clearly and reading lots of text, and therefore prefer oral and interactive + real-time transmission of the information. Those developers, I suppose and hope, are discovering that they can talk out loud to their agents, explain everything interactively, and then the agent can create whatever longer-term artifact it wants to record the understanding. Multi-modal interfaces FTW?
- jiggawatts2 days ago
  When one of the top Anthropic people said something along the lines of “Our users that invested heavily into their documentation are the best positioned to reap the benefits of AI tooling” I just laughed and laughed, then gazed upon the bare plains of comment-free spaghetti code written by developers who don’t even work here any more.
- AlexeyBelov2 days ago
  At $job and $previousJob we (the devs) were never given time to properly keep the documentation up-to-date. It didn't matter that people were asking the same questions and discovering the same things again and again.
  Now, at $job, there is a top-down directive for quickly documenting every part of every important workflow and every idiosyncrasy of our products.
  So, developers knew that all along.
sigbottle3 days ago
There's a lot of things that we mean when we say 'docs'.
The great talk "No Vibes Allowed" put me to the far end of the other extreme - persistent long term state on disk is bad. Always force agents to rebuild, aggressively sub agent or use tools to compress context. The code should be self documenting as much as possible and structured in a way such that it's easy to grep through it. No inline docs trying to describe the structure of the tree (okay, maybe like, 3 at most).
I don't have the time to build such an elaborate testing harness as they do though. So instead I check in a markdown jungle in ROOT/docs/* . And garbage collect them aggressively. Most of these are not "look for where the code is", they are plans of varying length, ADRs, bug reports, etc. and they all can and *will" get GC'ed.
I still use persistent docs but they're very spare and often completely contractual. "Yes, I can enumerate the exact 97 cases I need to support, and we are tracking each of these in a markdown doc". That is fine IMO. Not "here let me explain what this code does". Or even ADRs - I love ADRs, but at least for my use case, I've thrown out the project and rewritten from scratch when too many of them got cluttered up... Lol.
I'm also re-implementing an open source project (with the intent of genuinely making it better as a daily user, licensed under the same license, and not just clean rooming it), which makes markdown spam less appealing to me. I kind of wish there was yet another git wrapper like jujutsu which easily layered and kept commits unified on the same branch but had multi-level purposes like this. Persistent History for some things is not needed, but git as a wrapper for everything is so convenient. Maybe I just submodule the notes....
Note: my approach isn't the best, heck, 1 month ago OpenAI wrote an article on harness engineering where they had many parallel agents working, including some which aggressively garbage collected. They garbage collected in the sense that yes, prolific docs point agents to places XYZ, but if something goes out of date, sync the docs. Again, That works if you have a huge compute basin. But for my use cases, my approach is how I combatted markdown spam.
- 01HNNWZ0MV43FF2 days ago
  ADR = "Architecture Decision Record" https://github.com/joelparkerhenderson/architecture-decision...
- philipp-gayret2 days ago
  One of the better ways to maintain docs I've seen is with tests that let you describe what the inputs and outputs were for an API, and from it the framework generated your docs. (This was Spring Rest Docs) We included aggressive checks to have every input and output tested, it meant we had one truth about what fields existed: The code was aligned with the tests, and the tests were also the docs. I really liked this idea; Just one record of the truth. Granted it doesn't capture the intent of the code perfectly, but it solves a lot of the garbage collection.
jmward013 days ago
When I start a new project with a team I start off with asking 'how we will work' and part of that is 'how we will communicate'. Less is more in that world. Jira, confluence, github, slack, email, standup, ad-hock meetings, bongo drums, etc etc. The more places you communicate the harder it is to keep everyone on the same page. I have always been a fan of putting docs next to code for this exact reason and, as far as I can tell, it has been the right decisions every time.
With AI code assistants I personally spend 90% of time/tokens on design and understanding and that means creating docs that represent the feature and the changes needed to implement it so I can really see the value growing over time to this approach. Software engineering is evolving to be less about writing the code and more about designing the system and this is supporting that trend.
In the end I don't think AI hasn't fundamentally changed the benefit/detractor equation, it is just emphasizing that docs are part of the code and making it more obvious that putting them in the code is generally pretty beneficial.
GeoSys2 days ago
For open source, this has been the practice for many projects. The docs is often in README.md or in a separate folder "docs". For larger projects, there could be a separate repo from where a docs site is built.
However, in corporations, docs are often in Confluence or MS Sharepoint, separate from the code. Tech specs often require comments, discussion, or estimate/budget approvals from non-tech staff. Hence, some corporate AI coding tools can refer to docs in such corporate sites. That doesn't work too well yet, IMHO ... time will show.
gbro3n3 days ago
Bit of a plug I suppose, but this was what motivated me to set up AS Notes, my VS code extension which makes VS Code a personal knowledge management system, with linking and markdown tooling. I've built an html converter so they can be published to github pages from the repo. It's here if it's of interest to anyone https://www.appsoftware.com/blog/as-notes-turn-vs-code-into-... ... I'm so much more motivated to write docs when a) its easy to keep them up to date using an agent, and b) someone (agents) will actually read them!
codethief2 days ago
> Answer to objections
> Non-engineers usually don't have repo access. [Answer:] (1) You can deploy your docs on an internal-only website. (2) There is clear trend with non-engineer code access (which poses some interesting security challenges).
Regarding (2): If, on the other hand, you have your agent use MCP to query, e.g., Confluence, anyone with access to Confluence could in theory do a prompt injection and possibly get access to your repo.
At least doc changes in the repo will undergo code review.
r2vcap2 days ago
Interesting idea overall, and I would support doing this if we can.
Some constraints are:
- Non-programmers are not used to working with Git.
- In practice, they (usually PMs or feature designers) need to write their documents somewhere else.
Possible solutions are:
- Make non-programmers use Git as a documentation tool (upgrade your tooling or GTFO).
- Build a two-way sync tool so that programmers and non-programmers can work from the same source.
```
  - However, in practice, an SSOT (single source of truth) architecture is usually much simpler. Two-way sync tends to be quite difficult, especially across different platforms.
```
cborenstein2 days ago
Agree with the post. Working on https://spectagon.md which aims to improve the workflow around reviewing docs in the repo.
Reviewing docs in Github isn't great - as the post mentions, Google Docs or similar is typically where review happens. Spectagon aims to change that so that you can submit PRs for docs and you get a great review experience as a layer on top of Github.
whatever13 days ago
More importantly move your docs from anything else to pure markdown. Finally we are free from weird file formats and superfluous syntax for docs.
- hrmtst938373 days ago
  Pure markdown is fine until you need decent tables or structured metadata. Docs-in-repo sounds clean on paper, but the minute you need comments, suggestions, inline edits, permissions, and approvals from people who do not live in git all day, you are recreating half of Notion or Google Docs with plugins and glue code.
  Then you ask marketing or support to open a PR. That is usually where the markdown honeymoon ends.
  - kevin_thibedeau2 days ago
    ReST delivers most of what Markdown can't.
odie55333 days ago
What about a OneDrive folder shared with all developers, mounted in a place the AI can access? Putting docs in git makes it slow to iterate and share. That's my hesitancy with committing them.
alansaber3 days ago
Not sure I agree with this. MD files need to be constantly synced to code state- why not just grep the code files? This is just more unstructured indexing
- twelve403 days ago
  yeah my teammates seem to enjoy checking in endless walls of MD texts of "documentation" generated by llms after it's done adding a feature. So even if that's an extreme and your documentation is more thoughtful, there is still a problem of:
  * redundancy with the code: if code samples can be generated from the code, why bother duplicating them? what do they add? can they not be llm-generated later? and possibly kept somewhere out of the way (like, a website) so as not to clutter the codebase with redundancy
  * if you do go for this duplication, then you are on the hook for ensuring it's always up-to-date otherwise it becomes worse than duplicate: misleading
  So my preference is, when adding something to the repo, think very hard whether this information is redundant or not. Handcrafted docs, notes, comments that add more context like why was this built that way after a ton of deliberation - yes. Anything that is trivially derived from the code itself - no.
  - MoreQARespect3 days ago
    I've been trying to push people to use hitchstory or similar to generate docs from specification tests precisely to avoid that redundancy but most people just look blankly at it and go "why don't you just do that with AI?"
- gbro3n2 days ago
  The code doesn't always say "why".
- nulltrace2 days ago
  Grepping works when you wrote the code. Not so much when someone else installs your package and has no idea which export is public API. We added a one-page markdown saying "use these, ignore the rest" and the wrong-import issues mostly stopped.
jmclnx3 days ago
Sounds like they are saying use a repo like git for your documents to help AI read/"understand" your docs. Is that correct ?
I am all for using a source control system for your documents, I usually use RCS. But give AI access to your docs, no thanks. If I upload any of my docs to a public server (very rarely happens), they are compressed and encrypted to make sure only I and a few people can view them.
- rikroots3 days ago
  For me it's a case of, I have to expose my canvas library documentation for the training data bots to find and (hopefully) include in the LLM training data because it's the only way I'll ever get LLMs to:
  A) accept that my library exists, and has its uses (it's a tough world out there for canvas-focussed JS libraries that aren't Fabric.js, Konva.js or Pixi.js)
  B) learn how to write code using my library in the best way possible (because the vibes ain't going away, so may as well teach the Agents how to do the work correctly)
  Plus, writing the documentation[1] for a library I've been developing for over 10 years has turned into a useful brain-dumping activity to help justify all the decisions I've made along the way (such as my approach to the scene graph). I'm not going to be here forever, so might as well document as much as I can remember now.
  [1] - https://scrawl-v8.rikweb.org.uk/docs/reference/index.html
- ssgodderidge3 days ago
  > Just like code should be primarily written for humans to read, all files in a repository is written primarily for humans to review
  The author at least acknowledges the point of files is to be read by humans.
  Also the article is talking specifically about public docs mean to be used by others, not ones you’re specifically trying to keep private
- gitaarik2 days ago
  So the code you'll share with an AI, but not the docs?
globular-toast2 days ago
Wait, who didn't have the docs in the repo? Where else would it go?
- iainmerrick2 days ago
  There are two main options, put your docs in the repo, or throw them all over the floor. Many companies opt for the floor.
- xorcist2 days ago
  Is the Git Book part of the git repo?
  Is the Linux Doc Projec part of the kernel?
  No. For good reasons. The only people who insists all doc must live in the same repo as the code are the ones who does not value documentation.
  Note, that in both examples above there is a documentation in the main repo, but not all documentation lives there.
jaredcwhite3 days ago
"because of AI" is not a valid reason to change anything about how developer communities & projects are managed.
- gitaarik2 days ago
  But if you have an AI-based development environment it can have it's influence