Rust at Scale: An Added Layer of Security for WhatsApp(engineering.fb.com)

265 pointsby ubj11 days ago16 comments

cong-or10 days ago
The 160k → 90k LOC reduction is nice, but the parallel rollout is the more interesting part. Running Rust alongside the C++ version and using differential fuzzing to check equivalence is a lot more realistic than “rewrite and pray.” You get incremental validation with the old system as a fallback. Curious how long they ran both before cutting over.
Binary size is a real concern on the client side. On servers the Rust stdlib overhead usually doesn’t matter, but when you’re shipping to billions of mobile devices, every KB counts. Good to see they invested in build tooling instead of just accepting the bloat.
- galangalalgol10 days ago
  Did they say anywhere what they did? Rebuilding the stdlib as part of your build can shrink it a lot depending on how much of it you use, but that is still nightly only. Maybe they went no_std or created their own?
  - surajrmal10 days ago
    They didn't but keep in mind that the app is currently 170MiB. The standard library shouldn't have added more than a few hundred kilobytes. They already likely pay similar costs for c++, but it's more worthwhile as they have a lot more c++ code total.
    Also note that if you statically link to the rust std library, lto will excise the majority of it anyways, no need to rebuild it.
    galangalalgol10 days ago
    The default hello world stripped with one codegen unit and panic=abort was 342kB both nightly and stable. Adding lto dropped it 42kB in stable and 40kB in nightly. Adding build-std and only building core did not reduce it any further in size.
    metaltyphoon10 days ago
    I assume OP is taking about using -Zbuild-std on nightly. This will drop it much more.
    galangalalgol10 days ago
    That is what I was talking about. It didn't reduce it at all over just lto. If I'd set optimization to z it probably would have gotten some back, but that starts impacting performance
  - jsgf9 days ago
    This is built with buck, and core/std are built as a source dependency.
erithax10 days ago
> We believe that this is the largest rollout globally of any library written in Rust.
I think that crown currently goes to https://github.com/googlefonts/fontations which is included in Chromium, not sure if it's on all platforms yet. Moreover, the translative dependencies of Fontations (click through https://crates.io/crates/fontations/0.3.0/dependencies) should have an even (slightly) larger install-base.
EDIT: from the quote you can also gather that they don't use https://github.com/signalapp/libsignal
- mdriley10 days ago
  Just a few more Rust libraries we've shipped in Chromium:
  - https://github.com/image-rs/image-png
  - https://github.com/webmproject/CrabbyAvif
  - https://github.com/RCasatta/qr_code
  - https://github.com/unicode-org/icu4x
- dcsommer10 days ago
  Just for reference, Wamedia ships on the major Meta apps and on iOS, Android, Desktop, and Web platforms.
londons_explore10 days ago
> over 3 billion people to message securely each and every day.
Whatsapp is a chat application with 3 billion daily active users.
For those of you in the US (where Whatsapp is seldom used), this is a fact worth remembering.
If you want to build products for the rest of the world, you need to know how those users think and breathe - and for 3 billion of them, Whatsapp is how they talk.
- jraph10 days ago
  What one should do about this? I mean, beside working on lowering that number.
  (Asking as a European who quite stubbornly refuses to install it - there are dozens of us. Dozens!)
  Edit: please don't participate in making WhatsApp even more inescapable as it is today.
  - harikb10 days ago
    As a developer, I tried building an app that needs to use Whatsapp for communication. Unfortunately my phone number got blocked by the second test message. No Spam. Not marketing, just a test message to my own number. Along with it, they blocked my entire business, my LLC, and anything tied to it.
    I have been trying to get hold of anyone or anything at Whatsapp. I've spent 6 months trying to navigate the bureaucracy. Facebook support claims they can't touch WhatsApp; WhatsApp support ignores the Facebook side. If you're building on WA, have a backup plan.
    If any Whatsapp employee reading this can look into my WBA Account 1117362643780814
    morpheuskafka10 days ago
    The number is only checked at login, and after that you can now create a WebAuthn passkey (iCloud Keychain/Google Passwords synced to your next phone) for future sign-ins so it's actually only needed for first sign up. So just get a prepaid SIM or eSIM and make another account unless your business is so large that tons of people know your number.
    harikb9 days ago
    Sorry I am confused. I have a "WhatsApp Business Account", tied to an "Business" (verifications all done). What I am talking about is registering a phone number that acts as the "Sender/Responder" of the messages from my customers. I am not trying to use WhatsApp from my phone manually, but have my app communicate with my customers programatically. Hope this is clear.
    I can't do any of the above,
    1. Requesting a new test number. Test numbers are placeholder 555 number that works only within WhatsApp test network. Can't get one.
    2. Registering a new, real phone number (SIM obtained from a regular tele provider)
    3. Disconnecting the WhatsApp product from the Facebook App to reset the integration.
    Although the FB app is being used, I don't have any WhatsAppp users (because I have not even made the product), so wiping out any WBA accounts and starting fresh is also okay, if someone can do this.
    rvnx10 days ago
    Telegram API is easier to handle as far as I know if that can somehow help (in case you want live ChatGPT or notifications for yourself in a mobile chat)
    duskwuff10 days ago
    Telegram's bot API is a lot easier to get started with for sure. It's got some rough edges once you start trying to do anything more complex, though, and the underlying MTProto API is nothing short of bizarre.
    I'd urge caution before using them as a component of your business, though. Their business strategy is pretty chaotic and has relied heavily on weird cryptocurrency-adjacent plays (e.g. TON / Fragment / gifts). They've made a couple of attempts to introduce business features, but I'm not sure they've had any substantial uptake.
    morpheuskafka10 days ago
    Yeah, which is ironic given that it is not E2EE (unless specifically opted in for a private chat, and even then some would argue the MTProto crypto isn't good enough, although those people wouldn't trust WhatsApp ether). WhatsApp is overwhelming associated with legitimate (though in many countries, primarily overseas) users, and Telegram is somewhat associated with shady activities.
    That said, Telegram is likely a lot more open for a business type that is legal but still regulated or illegal in some countries (legalized/unregulated substances, tobacco/e-cigarettes, adult content, etc.), probably less worried of random bans/demonetization.
    Despite not being E2EE, Telegram also seems to have higher usage in censored countries (Russia and Iran etc). Once a Russian guy in Korea randomly asked if I had Telegram wanting me to take a picture for him since his phone was dead -- obviously had no idea that sounded like a massive scam flag to most Western users.
    harikb10 days ago
    I will look into it. But my user base is either WhatsApp or plain SMS text messaging.
    yandie10 days ago
    Yeah telegram is so easy to develop with - I was blown away. I was able to spin up a bot that checks for GE appointments with minimal effort.
    imtringued8 days ago
    You're supposed to go to a local WhatsApp partner instead of contacting WhatsApp directly if you want to get API access for sending messages.
    https://business.facebook.com/messaging/partner-showcase
  - embedding-shape10 days ago
    I guess if you want to lower that number, you'd need to build something better, in some way. Answered as another European who've had Whatsapp forever, as some stubborn people refuse to move away from it, and also bunch of businesses use it.
    01HNNWZ0MV43FF10 days ago
    Network effect is killer. "better" would include having more than 3 billion people already on it.
    Maybe the EU or China will crack down on it. A single company shouldn't decide who gets to talk to half the world. If that company is American they will not tolerate it for long.
    Personally DeltaChat is my new favorite Thing but it falls afoul of Zooko's Triangle - A WhatsApp number or POTS number is short because it's centrally controlled and you have to pay for each one. DeltaChat has public keys, so I have 20 of them, and nobody can control who gets one, but they're incredibly long... the QR codes are nightmares.
    embedding-shape10 days ago
    > Network effect is killer. "better" would include having more than 3 billion people already on it.
    At one point people moved from something else to Whatsapp, and that happened before Whatsapp had 3 billion people on it. If it's good, early adopters will adopt it and want others to adopt it too, then it snowballs from there.
    It has happened before, and as long as new regulation doesn't solidify Whatsapp/FB in their position, it can happen again :)
    riffraff10 days ago
    WhatsApp happened at a time when, in Europe, you paid for SMS.
    WhatsApp allowed people to send SMS without paying, or rather, paying just once to buy the app, so it was instantly valuable if you just convinced your spouse or parents or a single friend to install it.
    To overcome it now, you need a lot more effort (or rely on enshittification, which I'm sure will happen).
    embedding-shape10 days ago
    No, before Whatsapp, people were mostly using Facebook messages, at least where I lived at the time.
    And no one was paying per SMS at the time we were using SMS for communication, almost everyone I know were on monthly plans that gave you N text messages and N minutes of calls for static sum each month.
    The first people I saw who started using whatsapp, was people who were communicating across the border, because even if you had a monthly plan, those didn't include international messages. Eventually we all converged on whatsapp because that's what outside family and relatives used anyways.
    vlovich12310 days ago
    WhatsApp launched in January of 2009 compared with Facebook Chat which launched in 2008. WhatsApp saw drastically wider adoption among the general populace and paying for “N text messages per month” is precisely what people refer to as paying per message - WhatsApp had unlimited messaging.
    embedding-shape10 days ago
    Is "Facebook Chat" not the same as "Facebook Messenger", the separate chat client? Because I seem to remember a lot of people using the chat built-in into Facebook (not Messenger) a lot earlier than the standalone app/client, maybe I misrecall.
    > paying for “N text messages per month” is precisely what people refer to as paying per message
    Maybe I said it wrong, "N text messages per month" for me means "Pay us 10 EUR per month, send up to 5000 messages" for example. Doesn't matter how many you send, you pay the same.
    While "pay per message" is "Every text message you send, costs 0.01 EUR". Maybe I'm using the wrong words, but that's how I understand it.
    Most of the people who were "texters" (in my circles) were on plans offering the first way of paying, while hardly anyone was doing it the second.
    Another important part, was that most telecom's had free SMS and calls if you were with the same company (and still do, AFAIK), so constant bickering about what plan people are on and why they don't change so it's free and yadda yadda.
    Many people were already mostly texting for free at this point.
    vlovich12310 days ago
    Facebook chat preceded Messenger which was a rebranding and separating into a standalone app precisely because WhatsApp ate their lunch so bad.
    The rates people were paying back then were extortionate - like 60-90% profit margin. When WhatsApp launched, plans were 5-15 euros/month for 100-500 messages with ~0.15 per message for overages. So you might not count the bundle as a per text message, but it really is which you can tell by what happens if you send more than your bundle allowed. Compare that with WhatsApp’s $1/year for unlimited messaging and you start to see the pricing disparity.
    Many people were not mostly texting free in 2009. I think you’ve got the timelines mixed up. That started changing towards the mid to late 2010s precisely because of internet-based chat apps on the phone and plummeting data costs making the telco’s SMS pricing plans insane.
    embedding-shape10 days ago
    Let me preface this with that my experience comes from Sweden in the 90s and 00s, and is a correct and truthful lived experience of my life. Seemingly, things were different were you lived, and that's fine, but that's not how it worked all across Europe, so at least we can agree on that :)
    The initial claim of "WhatsApp happened at a time when, in Europe, you paid for SMS." maybe was true in parts of Europe, but clearly not everywhere. People were mostly using the Facebook chat (not Facebook Messenger/Chat) already before Whatsapp started being used, although Whatsapp in Sweden still isn't as popular as in other countries. In Spain, everyone uses Whatsapp, in Sweden, seemingly the people I talk to only have Whatsapp to communicate with me and others outside the country.
    > Many people were not mostly texting free in 2009
    Most people I knew definitively were mostly texting for free even before 2009, again, at least in Sweden.
    vlovich1239 days ago
    I think we can agree that Sweden is not a representative sample of what happened in Europe as a way to explain why WhatsApp became dominant for the majority of people in Europe.
    I grew up in Canada so my knowledge is purely from talking with people in non-Swedish parts of Europe that I met and also reading contemporary articles analyzing the space as well as retrospective analysis of what led to WhatsApp’s popularity and dominance.
    stavros10 days ago
    The EU has already forced WhatsApp to be interoperable. Of course, Meta complied maliciously, making it a setting that you have to enable, but at least it's a start.
    embedding-shape10 days ago
    I guess the bean counters figured it'd be cheaper compared to ultimately paying the fine they get for maliciously following the rules. Hope the fine ends up large enough to make them wrong :)
  - londons_explore10 days ago
    Make your customer support on whatsapp. "Drop us a message to change your order". Allow ordering/enquiries over whatsapp.
    Send 2 factor verification pins over whatsapp - it is more reliable than SMS and generally there is a better 1:1 mapping between whatsapp accounts and real humans than phone numbers, so it is a good anti-spam or good way to distribute "first month free" type deals whilst keeping abuse low.
    Obviously make sure all URL's have info cards properly rendered in Whatsapp for good share-ability.
    jraph10 days ago
    And now your customers are required to agree to Meta's term of services and to run some black box software, and you are screwed if Meta decides your business or your customers need to be kicked out.
    sieabahlpark10 days ago
    [dead]
  - mfashby10 days ago
    Force interoperability one way or another. WhatsApp is a closed system, if I want to use an alternative I'm stuck with adversarial interoperability, so stuff like Beeper (which is great, but...) which might get my account banned. Or waiting for some legislation to force WhatsApp to open it's API and let me interact with my contacts there without being locked into their apps
    darrenf10 days ago
    There is legislation in the EU, and BirdyChat announced compatibility.
    https://www.birdy.chat/blog/first-to-interoperate-with-whats...
    jraph10 days ago
    BirdyChat, the existence of which we all first became aware at the same time as that legislation and which nobody can use yet, only join a waitlist... :-)
    aaravchen7 days ago
    And apparently requires explicit WhatsApp user opt-in to be available. Meta is of course going to maliciously comply as best they can, so they've made sure interoperability is off by default and requires a specific opt in.
  - nextaccountic10 days ago
    > What one should do about this? I mean, beside working on lowering that number.
    Every business in Brazil has an whatsapp to talk to their clients. Sometimes this whatsapp goes into the phone or computer of a real human being. Other times, it's manned by a bot (usually a dumb choose-your-own-adventure bot - I don't see business using LLMs for this here)
    Indeed I use food delivery apps (ifood here) only to check out the menu of delivery restaurants, then I search for them in Google so I can order directly from them through whatsapp. This won't work for some dark kitchens, but other than that it's pretty reliable and avoid the middleman
  - tremon10 days ago
    Advocate protocols over platforms. Have your government take an active interest in opening up closed communication systems and mandating third-party client access.
  - morpheuskafka10 days ago
    Well, you now have the right to use third-party apps to exchange messages with WhatsApp users, but apparently your law only covers it if the other user is in the EEA. So you are back to square one when communicating with India, Pakistan, and much of SE Asia, Africa, and MENA.
  - galangalalgol10 days ago
    Can you describe your reasons? I haven't developed an opinion as no one here uses it.
    jraph10 days ago
    I refuse to use proprietary software as much as I can, especially when it has a strong network effect where it encourages others to join.
    Meta is also a despicable company, they don't need my help to succeed.
    (edit: and I haven't abandoned the idea to switch back to a Linux mobile OS at some point, and WhatsApp would be a pain)
- zikani_0310 days ago
  Where I come from (Malawi, Africa), WhatsApp is so widespread that most people prefer it over email - to the extent that people don't really check their e-mails unless it's required for work or they are applying for something. For most people, WhatsApp is the de-facto communication channel.
  I help moderate a community of developers and we hit the whatsapp group limit of 1024 members and sometimes have to wait for someone to leave (intentionally or accidentally) before we can add new members. We've tried to move people onto "better" platforms like Discord or Slack but we always end up coming back to WhatsApp which is subsidized via MNOs (mobile network operators) social media data/internet bundles and for the fact that most people are just stuck on whatsapp.
- atoav10 days ago
  Yeah and we know it is over 3 billion because security researchers from the university of Vienna could read that in one go from one source ip address without encountering any rate limiting:
  "phone number, public keys, timestamps, and, if set to public, about text and profile picture. From these data points, the researchers were able to extract additional information, which allowed them to infer a user's operating system, account age, as well as the number of linked companion devices."
  See: https://www.univie.ac.at/en/news/press-room/press-releases/d...
- signal1110 days ago
  In markets where Whatsapp is entrenched, it’s already begun to enshittify.
  They have ads and spam already (sorry, no-consent messages from businesses). This isn’t even new. [0]
  There’s a clear pattern, say “we’ve rolled out strict policies”[1] and then… nothing changes on the ground, and TechCrunch writes another “they’ve fixed it” article a year later.[2]
  Also their Communities feature has pretty crap UX.
  Yes WhatsApp’s pervasive. But if pervasive was the end of the story, we’d all be using ICQ and AOL. The last thing any country needs is to hand over more of their lives to Facebook [sic].
  [0] https://techcrunch.com/2022/10/10/in-india-businesses-are-in...
  [1] https://techcrunch.com/2024/11/20/whatsapp-will-finally-let-...
  [2] https://techcrunch.com/2025/10/17/whatsapp-will-curb-the-num...
- moomoo1110 days ago
  Sure, but like with most things, maybe like 200 million max of them in NA/EU would actually bring in real money.
- axegon_10 days ago
  Honestly? That claim seems a bit(read A LOT) exaggerated. I haven't had whatsapp in a decade and none of my friends(scattered all over Europe) or family uses it. Viber used to be a big deal and to an extent still is in some areas of Europe. Personally I think I've talked almost everyone into migrating to Signal.
  - conradludgate10 days ago
    No one I know in the UK seriously uses signal. If I'm asking for a phone number from a neighbour it's going to be WhatsApp
    jsiepkes10 days ago
    In the Netherlands Signal is getting traction. I talk to most people via Signal, about 85% of my messages are via Signal. Which includes my parents, and I didn't even put them on Signal.
    DANmode10 days ago
    Yep.
    Nontechnical uncle messaging me on Signal was a great signal that Signal was gaining some social traction.
    DANmode10 days ago
    What part of the UK?
- Capricorn248110 days ago
  Doesn't this description describe Facebook itself? Should we make apps more like that as well? Because they could not be more polar opposite each other.
storystarling11 days ago
The hardest part of a rewrite like this is usually maintaining bug-for-bug compatibility with the legacy parser rather than the actual Rust implementation. Most real-world media files are malformed in some way that the C++ code implicitly handled, so if you write a strict parser you end up breaking valid user data. Differential fuzzing seems like the only practical way to map that behavior without manually reviewing millions of edge cases.
- dwattttt11 days ago
  It sounds like it's a design goal of this "wamedia" to _not_ maintain bug compatibility with media players.
  - storystarling10 days ago
    I suspect it is actually about maintaining permissiveness for malformed inputs rather than keeping security bugs. I ran into this building ingestion for a print-on-demand service where users upload technically broken PDFs that legacy viewers handle fine. If the new parser is stricter than the old one you end up rejecting files that used to work, which is a non-starter for the product.
- rubymamis10 days ago
  AI reply?
  - storystarling10 days ago
    Not AI. Anyway, the real issue is permissiveness vs strict parsing—real-world files are messy.
nevi-me11 days ago
> We believe that this is the largest rollout globally of any library written in Rust.
I suppose this is true because there's more phones using WhatsApp than there are say Windows 11 PCs.
Given that WhatsApp uses libsignal, is it safe to assume that they haven't been using the Rust library directly?
- marisen11 days ago
  WhatsApp doesn't use libsignal, and Android is already pretty Rusty and deployed more than WhatsApp around the world (not just smartphone. Tons of "embedded" use cases also run on custom Android)
  - charcircuit10 days ago
    >deployed more than WhatsApp
    If you count old Android versions before Rust was added.
  - fabrice_d10 days ago
    WhatsApp was using libsignal (the C version) when I worked on the KaiOS integration in 2017/2018.
  - 11 days ago
    undefined
  - pjmlp11 days ago
    Like our gym devices that have a full tablet to run a basic application to control weights, talk about wasting money.
    g947o11 days ago
    It doesn't make sense for that device alone, but the vendor probably supplies all the different equipment in the gym. Using a tablet simplifies their supply chain, deployment, debugging/repair, app update process and simply supports more features. There are probably some connectivity features on the device, for example. When you look at all of that together, it's hard to argue it's wasting money.
    It's like complaining about Electron apps. For sure I love small native apps like everyone else. But, if Electron enables a company to ship cross-platform apps and iterate faster, who am I to say no?
    (I happen to have seen some of those tablets in diagnostic mode and poked around a bit. These things are much more complicated than you think.)
    rswail10 days ago
    Once you price in the cost of integration, plastics, ROHS, CE and other regulatory/certifications, the extra cost of an Android tablet which already has a lot of that starts to make sense.
    If you also add in the extra ease of things like device management across fleets etc, it becomes a no-brainer for the manufacturer.
    jerf10 days ago
    The major problem with sticking an Android tablet on to exercise equipment is the difference in life spans. Android tablets are generally going to last you 4-5 years. Weight equipment should be able to last decades. There is some simple & cheap hardware that can last decades, but it is legitimately harder to program.
    Even worse was an article some months back about Android tablets hooked to heating & cooling systems expected to last 20 years. There's no way those things are making it at scale.
    g947o10 days ago
    > Weight equipment should be able to last decades.
    "should" or "actually can"? Do you have references to show that's the actual lifespan of the equipment, mechanically?
    jerf10 days ago
    Weight training equipment lasts decades all the time. It's just big piles of metal, it's not hard to get right.
    What actually prompted the engineering-CYA "should" is if the Android tablet is controlling some sort of robotic system for selecting weight sizes, that that system might have an expected life span on par with a tablet, being a physical thing moving around some pins or something in a potentially hostile user environment. That'll break long before anything else would.
    10 days ago
    undefined
    g947o10 days ago
    So you don't have a reference.
    I'm just going to ignore this.
    jerf9 days ago
    If you are the sort of person who needs a reference for "weight equipment lasts a long time", feel free. Whatever guilt and shame you think I should be feeling over such a claim, believe me, I don't. I'm more in the "feeling pity for you" department here; I've been around enough to know what kind of person types messages like this.
    pjmlp10 days ago
    Well, doesn't look like to me, and a plain ESP32 with a touch screen would do the job for displaying a weight bar with plus, minus and reset count buttons.
    usrusr10 days ago
    And then you get to a cardio unit where you want a completely different set of features and have to start over. Going lean on hardware only makes sense when you push out a very high number of units, when you have to deal with battery constraints or when you just have a lot of intertia, the combination of existing codebase and developer filter skillset.
    pjmlp10 days ago
    Except all the machines have the same feature set I mentioned.
    Agree that wanting to hire cheap developers is why they did it that way, the current interface is so laggy that I would bet it is Web based, on top of running Android for nothing.
    rswail10 days ago
    That's not a problem of the platform, but is a problem of the developers.
    The extra cost of an Android capable tablet (maybe $200 especially wholesale) is a minimal hardware cost considering the overall price of the equipment is in the thousands.
    But finding good embedded developers is a very difficult problem to solve, much easier to find Android app developers and then you get the Android eco-system for free like device management, OTA updates etc.
    Put all the sensors and controls on a USB bus and you need one or two actual embedded developers to deal with the drivers and the rest of the developers can build the UI that people see.
    In the case of a gym, the person buying the equipment is the customer, not you.
    They want features that will make you "sticky" to the gym, plus save costs on training you on how to use the equipment.
    usrusr10 days ago
    Cardio units have neither a "weight bar" nor a repetition counter, but they have a whole universe of possible features in the realm of scripted sequences, reactions to HRM signals and even just "making time pass" features. With unbounded gimmickyness, the sky is the limit.
    Personally, I'm a bit of an aficionado of close to the metal sports electronics. When I stare at gym screens I immediately notice updates that are supposed to come in once a second to get randomly delayed by what must be hundreds of millis. But I can totally see why they went that route. It's a market where feature quantity is big as a success metric and using a maintenance-friendly platform is even bigger. Wether Android actually checks that box might be debatable, but a bad embedded implementation could easily be worse, no doubt about that.
    In the old days, those screens would have randomly dropped into some Windows desktop failing to operate in some kiosk mode fantasy.
    miki12321110 days ago
    And then you start selling in a country which demands accessibility for your equipment. Good luck getting a 20+ language human-sounding TTS system on your ESP32.
- pjmlp11 days ago
  If you watch "Microsoft is Getting Rusty: A Review of Successes and Challenges" it appears the whole effort is more on the Azure side, and besides some timid adoption like GDI regions, there is a lukewarm adoption of Rust on Windows side, still pretty much a C and C++ feud.
  https://www.youtube.com/watch?v=1VgptLwP588
palata10 days ago
> Two major hurdles were the initial binary size increase due to bringing in the Rust standard library [...].
They don't say what they did about it, do they? Did they just accept it?
- sluongng10 days ago
  I suspect they just use no_std whenever its applicable
  https://github.com/facebook/buck2/commit/4a1ccdd36e0de0b69ee...
  https://github.com/facebook/buck2/commit/bee72b29bc9b67b59ba...
  Turn out if you have strong control over the compiler and linker instrumentations, there are a lot of ways to optimize binary size
- dcsommer10 days ago
  We invested a lot into build system optimizations to bring this number down over time, although we did accept on the order of 200 KiB size overhead initially for the stdlib. We initially launched using a Gradle + CMake + Cargo with static linking of the stdlib and some basic linker optimizations. Transitioning WhatsApp Android to Buck2 has helped tremendously to bring the size down, for instance by improving LTO and getting the latest clang toolchain optimizations. Buck2 also hugely improved build times.
  - palata10 days ago
    Thanks!
- pornel10 days ago
  Probably yes. It's ~300KB per binary, and it's a one-time cost.
  It can be avoided entirely by disabling the standard library, but that's inconvenient, and usually done only when writing for embedded devices.
  Usually the problem isn't the size directly, but duplication of Rust dependencies in mixed C++/Rust codebases.
  If you end up with a sandwich of build systems (when you have library dependencies like C++ => Rust => C++ => Rust), each Rust/Cargo build bundles its copy of libstd and crates. Then you need to either ensure that the linker can clean that up, or use something like Bazel instead of Cargo to make it see both Rust and C++ deps as part of a single dependency tree.
  - surajrmal10 days ago
    The size is not fixed. It changes based on how much of the standard library you use. Dynamically linking the standard library is also a valid option in many cases.
    galangalalgol10 days ago
    Posted elsewhere but The default hello world stripped with one codegen unit and panic=abort was 342kB both nightly and stable. Adding lto dropped it 42kB in stable and 40kB in nightly. Adding build-std and only building core did not reduce it any further in size.
    surajrmal10 days ago
    I agree, but if you use more of the std library it will contribute more to the final image. I can write a 100 line rust file that ends up being 1MiB (even after lto) because I maximize as much code from the standard library as possible. This is not a knock on rust, but your statements can be a misleading as well. In practice most folks ignore the majority of the standard library so only a few hundred kib of std library end up in their binary.
    galangalalgol9 days ago
    Bit late, but I made a small program that did network and file io as well as using a variety of containers and running system commands. I couldn't get the default release over 650kB. Using a single codegen unit lto strip and panic=abort got that down to 432kB. Using build-std didn't get it any smaller still. When I added optimization for size was the only way I got build-std to shrink things any further than the other options alone, and that only got me 10kB. My conclusion is that build-std is not a substantial contributor. Using std seems to add 300kB-500kB depending on how much you use. That seems like a lot to me because I am old, but elf binaries add several kB of header so maybe I should stop worrying so much.
    surajrmal9 days ago
    If you build the standard library as a shared library it will be 4+MiB. The portion of that which you end up using is variable but there are ways to accomplish large usage without a great deal if code. I can get a 1.5 MiB binary down to 500KiB by dynamically linking the shared library. It's a net fun because I have many such binaries so it saves size in aggregate. It really does come down to what subset you use though.
    galangalalgol10 days ago
    Can it do lto on stdlib even without the nightly build-std flag?
    pornel9 days ago
    I mean you get one upfront cost for things like allocators, common string manipulation and std::fmt, std::{fs, io, path} helper functions, and gathering of pretty backtraces for panics (which is a surprisingly fiddly task, including ELF+DWARF parsers and gzip to decompress the debug info).
    A println!("hello world") happens to pull in almost all of it (it panics if stdout is closed).
    Later code growth is just obviously proportional to what you're doing, and you're not getting a whole new copy of std::fmt every time you call print.
- jsheard10 days ago
  Who knows what they did, but there are things which can be done: https://github.com/johnthagen/min-sized-rust
- menaerus10 days ago
  The whole article a bit watery which is why I read it as a PR rather than technical presentation
kpcyrd11 days ago
Very cool! I'm wondering if Signal is doing something similar? libsignal is implemented in Rust, but I don't know about the other parts.
I_am_tiberius10 days ago
> "WhatsApp provides default end-to-end encryption for over 3 billion people".
Wasn't there news lately that they can still read your messages somehow?
- wongarsu10 days ago
  WhatsApp could exfiltrate messages at the ends. But I assume the trick lies in the word "default". Didn't Skype also default to end-to-end encryption, unless there was a server flag that disabled it for that specific user (I might be fuzzy on the details)
  - londons_explore10 days ago
    I don't trust un-auditable client applications...
    If you want to assure me your e2e is secure, there must be at least two clients implemented by different people, with at least one of them opensource.
    Whatsapp used to have this, but lately they have cracked down on third party clients.
    mschuster9110 days ago
    > Whatsapp used to have this, but lately they have cracked down on third party clients.
    Blame spammers on that. The amount of scammers and spammers on Whatsapp is unreal.
    rvnx10 days ago
    Even if they have, this doesn't prevent from turning on a feature flag, or push an experimental build to some users.
    londons_explore10 days ago
    If there is a 2nd opensource client written by someone else, you would hope they would raise the alarm when asked to implement "feature flag 437 means send all the crypto keys to the server".
- 4gotunameagain10 days ago
  Every encryption is end to end if you're not picky about the ends, or metadata.
  Do you trust facebook (excuse me, meta) to not snoop on your messages, and to not share them with the "intelligence" agencies ?
  - Fripplebubby10 days ago
    This is not true. The IETF draft is explicit that E2EE means that the message cannot be read by any party other than the sender and the intended receiver. When companies like Meta claim they support E2EE, this is what they claim. There are no tricky semantics or legalese at play here.
    monocasa10 days ago
    To be fair zoom did claim E2EE, with one of the ends being their servers.
    morpheuskafka10 days ago
    Speaking of Zoom and encryption, its crazy that they bought Keybase (I think they basically said it was largely an acquihire) years ago, and have neither shut it down as everyone thought, nor materially changed it in any way. Unless they changed something it even gives 200GB cloud storage (KBFS) iirc.
    antonvs10 days ago
    It's not entirely accurate to say "any party other than the sender and the intended receiver," since the messaging app running on the user's device can read the messages. Something like "any third party (other than the app vendor)" would be more accurate. Without actually analyze app behavior, it comes down to trusting that the vendor doesn't do anything nefarious.
    londons_explore10 days ago
    One could imagine a design where even the app vendor is untrusted... You would send an encrypted chunk direct to the GPU, which would then decrypt and render the message text in some secure environment onto the screen.
    Neither the OS nor the application would know the contents of your message beyond "it's 500x700 pixels".
    Similar things are done for DRM video, and widevine level 1 or 2 haven't seen many breaches despite running on a wide array of hardware open to physical attack.
    antonvs10 days ago
    Oh it's definitely possible. The (dis)incentives tend to be strongly against such secure systems, though.
    londons_explore10 days ago
    In the messaging game, there is every incentive to be seen as the secure-est one.
    If you can have an e2e chat between two iphones locked in a big glass box with a sign that says "Anyone who can hack into this conversation gets $100M", that's a really good marketing campaign.
    If you can make the app use secure enclaves or whatever to take the ~100k people who write the source code of the libraries, app and OS out of the attack surface, that $100M becomes much safer.
    Fripplebubby10 days ago
    I think the draft covers this well: https://www.ietf.org/archive/id/draft-knodel-e2ee-definition...
    antonvs10 days ago
    Technical drafts will tend to get this right, where the communication often breaks down is how it's communicated to users.
    rvnx10 days ago
    As far as I remember, Google does the final signing of the APK, which is eventually the signature verified by the OS to verify if an update is valid or not.
    So Google can, if ordered or willing to help, create a new release track (e.g. experimental-do-not-deleted) and add specific e-mails to that track with the "improved" version.
    Nobody would be able to see that in real world, and you know what, if WhatsApp themselves are ordered, they can also create their own "test" track, it's just less covert but it would technically be working.
    In all cases, Google and Apple have to respect US laws, and the laws of earning money too.
    If you do not cooperative with intelligence / police services of your country, only bad things can happen.
    mr_mitm10 days ago
    Yes, the app could be compromised, or the OS, or the compiler of the app, or of the OS, or the OS of the compiler, or the CPU any of these things run on, etc. etc. None of that is relevant to the definition of E2EE.
    antonvs10 days ago
    It's relevant to how E2EE is described to users. Representing that it's not possible for anyone other than the sender or recipient to read messages is misleading and just incorrect in general.
    A particularly relevant point is when it comes to government interception. E.g. it would be perfectly possible for an messaging app to have a "wiretap mode" that the vendor enables for users that are the subject of a relevant warrant.
    rvnx10 days ago
    > When companies like Meta claim they support E2EE, this is what they claim.
    Well, that statement can only resolve to true.
    These requests of data collection are perfectly legal. FBI DITU gives an order: give me all chats from *@banana.com and they receive banana.com.
    From there, two choices from the perspective of a tech provider:
    a) You accept. You get paid.
    You can always claim you had been coerced / are a victim, and that everything has been done by the law.
    b) You refuse. It's a crime.
    You take the risk to lose over 250K per day (!) in fines, some other court scandals that will come to you, some shady private stuff (what if we learn about your secret jacuzzi ?), harassement of the team, be publicly shamed that you supported terrorists who caused actual death of Americans, etc. In addition, nobody will know that you are the privacy hero and you are not even sure that the data is not exfiltrated another way.
    To this day, Apple, Facebook, Google still deny participating in illegal requests. They claim these were lawful requests, that have been carefully looked one-by-one.
    Yes, we looked carefully and decided we won't enjoy losing 100M USD and go to jail.
    The trick is that the identifier / wildcard can be very vague and wide. Or there can be multiple of them, each of them are narrow, but put one of top of the other they are super wide.
    jolmg10 days ago
    Do companies that claim E2EE support face consequences if they don't abide by IETF's definition? Not like IETF governs them.
  - miki12321110 days ago
    > Do you trust facebook (excuse me, meta) to not snoop on your messages
    No, but I trust some nosy German guy at TU Whatever to spend hours poking at the assembly, find that hidden flag and proudly present it at 40C3.
    With enough eyeballs, all source is open (and AI will give us far more eyeballs than we have any idea what to do with).
    Sure, you can have different builds distributed to different people, but the NSA can also just do that with Signal, Signal being open source makes it that much easier. FDroid mitigates this somewhat, but it's not like the NSA can't get a fake TLS certificate for their domain and MITM your communications.
aloukissas10 days ago
I love how Meta will do anything but prevent phishing and prepaid credit card scams in Whatsapp/Messenger
blub10 days ago
Just like Google’s Rust-in-Android blogs this reads like a PR piece (and in the case of facebook also recruitment piece) with some technical words sprinkled in for effect. The overall communication quality is that of a random startup’s “look what we did” posts.
The interesting aspects, such as how they protect against supply-chain attacks from the dependency-happy rust toolchain or how they integrated the C++ code with the Rust code on so many platforms - a top challenge as they said - remain a mystery.
Would also be interesting to hear how much AI-driven development they used for this project. My hope’s that AI gets really good at Rust so one doesn’t have to directly interact with the unergonomic syntax.
- surajrmal10 days ago
  The point of articles like this is to help build credibility for rust adoption. Rust is still not very widely adopted industry wide, and a lot of smaller players only use established technologies that bigger firms have shown works well. Rust is not inevitable, and articles like this are necessary for its future industry adoption.
  - blub10 days ago
    I had already said it’s a PR piece, you’re merely rephrasing that and making it sound like a good thing.
    This and the Google blogs offer zero technical insights and I haven’t learned anything from any of them.
    surajrmal10 days ago
    PR makes it sound like it only benefits the company. It benefits the broader rust community as well. Where was it established that the article must provide you some technical knowledge to learn? I sure didn't go into reading it with the expectation it would.
- antonvs10 days ago
  > The interesting aspects, such as how they protect against supply-chain attacks
  There are standard techniques to help manage this that apply across languages, there's no reason to reinvent that wheel.
  > My hope’s that AI gets really good at Rust so one doesn’t have to directly interact with the unergonomic syntax.
  "Unergonomic syntax" is the battle cry of many people resisting learning a new language. AIs have progressed far enough that they can help you in that learning process, though.
  - blub10 days ago
    The dependency management and complexity/poor ergonomics are the two major technical problems with Rust. Normally the first one’s ignored while the second is downplayed, so it would have been interesting to see what (if anything) Facebook have done about them.
    Not only can AIs help, but they can write most if not all the code and spare the human from learning all the intricacies of individual programming languages. Problem is, reports are contradictory on compatibility with Rust. We know they work great with simpler/friendlier languages like Go or Python.
aero-glide210 days ago
Quite impressive, I did not know so many bugs were due to memory access.
- IshKebab10 days ago
  To be fair the increased reliability of Rust code over C++ isn't just because of memory errors (out-of-bounds accesses, use-after-free, type confusion, etc). You also get:
  * No undefined behaviour (outside `unsafe`, which is quite easy to avoid). In C++ there are many many sources of UB that aren't really memory errors directly, e.g. signed integer overflow or forgetting to `return` from a function.
  * A much stronger type system.
  Those two things have a really significant impact on reliability.
  - tialaramex10 days ago
    Rust's "A language empowering everyone..." tagline also helps justify the heavy lifting needed to prevent you shooting yourself in the foot, because we're all able to imagine a hypothetical less experienced programmer who might make a mistake even as we swear that we'd never make it ourselves.
mentalgear10 days ago
Cool - now we only need to get selling-you-out-for-profit-Zuckerberg out of WhatsApp to make it really trustworthy.
yasmineroy334 days ago
[dead]
wrtc_dev10 days ago
[flagged]
- randomint6410 days ago
  That's right, Signal (https://kerkour.com/signal-app-rust), Proton (https://kerkour.com/proton-apps-rust), Matrix, Wire and many more are using a share, cross-platform Rust core and a platform-dependent UI layer.
  But it's not only the security-critical paths, but also most of the business logic (see the 2 posts above).
- wongarsu10 days ago
  I agree with everything you say. But wow, does that comment sound like AI. Probably Grok?
  Not saying you are AI, you might just be a heavy user who picked up the same patterns
  - jsheard10 days ago
    If it were an old account I might have given them the benefit of the doubt, but they literally just joined to make this comment. There's so many green accounts popping up which reek of AI now, like I've seen ones where all of their comments are almost exactly the same length.
  - rob10 days ago
    It's a brand new account that reads 100% like a ChatGPT response where the author just swapped out the em dashes for hyphens when posting, knowing it's a common "indicator" people look for.
    It's more surprising to me that it seems to have already fooled a bunch of people looking at their replies to you.
  - 10 days ago
    undefined
  - m00dy10 days ago
    I like your AI slop detector, is it part of your consciousness ?
  - candiddevmike10 days ago
    The "is key - ", is a key giveaway.
    EDIT to expand the evidence: It's placing unnecessary emphasis on a one off mention in the article (differential fuzzing) and then writes a bunch of bullshit around what it thinks it means (it's wrong, differential fuzzing isn't running them both in parallel during a transition, it's a testing methodology based on inputs/outputs).
    braiamp10 days ago
    Which many people use. Heck, go to Stack Overflow about 10 years back. You will see people using it. It's a style.
    seritools10 days ago
    TIL I'm an AI
    jdxcode10 days ago
    I think it's a giveaway that it's human! A hyphen is incorrect punctuation.
    wongarsu10 days ago
    According to British style guides an en-dash would be correct in that usage, and the difference between an en-dash (–) and a hyphen (-) is pretty small. Seems perfectly defensible to me unless you are publishing a book or academic journal
    dewey10 days ago
    AI is trained on human output, so that's not really a good differentiator.
happyweasel10 days ago
Let's see how this unwrap()s in production scnr
- galangalalgol10 days ago
  Oh come on, that was funny. It also highlights a problem with the way people write rust. If your app panics it has a bug. People throw panics in cases that can absolutely happen, a file isn't there or fails to parse, some set of inputs is mutually inconsistent these are things for error checking. Even if the correct way to handle an error you detect is to stop the app, do that instead of panicking. Panics are for things that should be impossible. Ideally they even get optimized out.
justinlords10 days ago
The differential fuzzing approach is clever — way safer than a big-bang rewrite. Running both versions in parallel to catch edge cases before switching over is how you actually ship rewrites without breaking production. The 160k to 90k LOC drop is impressive, but the real engineering win is the validation strategy.
On binary size, static linking with LTO should handle most of the bloat without needing custom stdlib builds.
- chinathrow10 days ago
  We really need an AI filter here on HN.
  - stingraycharles10 days ago
    A comment like this works as well, let the community do its thing.
    rvnx10 days ago
    There are a couple of bots here.
    Quoting a user:
    keeping it simple: a flat $15,000 to get you on the front page of Hacker News. [...] contact e-mail below
    Expensive, but now with LLMs it's super cheap to do.
    Spend a week to do a bot, get 10'000 USD of ARR for your B2B tech SaaS, and applause from your investors.
    And a week is probably exaggerated, 2 days max
    stingraycharles10 days ago
    Do you have any actual evidence that these types of services are being offered for that type of price point, though?
    The reason I'm asking is that I actually believe the price point is much lower. It's probably much easier to get on the front page of HN of you time the submission + upvotes well enough.