frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Convert Photos to Atkinson Dithering

https://gazs.github.io/canvas-atkinson-dither/
58•nvahalik•1h ago•13 comments

Bill Atkinson has died

https://daringfireball.net/linked/2025/06/07/bill-atkinson-rip
682•romanhn•5h ago•142 comments

BorgBackup 2 has no server-side append-only anymore

https://github.com/borgbackup/borg/pull/8798
72•jaegerma•3h ago•35 comments

Self-Host and Tech Independence: The Joy of Building Your Own

https://www.ssp.sh/blog/self-host-self-independence/
43•articsputnik•3h ago•2 comments

Updates to Advanced Voice Mode for paid users

https://help.openai.com/en/articles/6825453-chatgpt-release-notes
13•mfiguiere•1h ago•8 comments

Discovering a JDK Race Condition, and Debugging It in 30 Minutes with Fray

https://aoli.al/blogs/jdk-bug/
24•aoli-al•2h ago•7 comments

Washington Post's Privacy Tip: Stop Using Chrome, Delete Meta Apps (and Yandex)

https://tech.slashdot.org/story/25/06/07/035249/washington-posts-privacy-tip-stop-using-chrome-delete-metas-apps-and-yandex
190•miles•5h ago•109 comments

The time bomb in the tax code that's fueling mass tech layoffs

https://qz.com/tech-layoffs-tax-code-trump-section-174-microsoft-meta-1851783502
1245•booleanbetrayal•3d ago•771 comments

Low-Level Optimization with Zig

https://alloc.dev/2025/06/07/zig_optimization
225•Retro_Dev•14h ago•87 comments

Why We're Moving on from Nix

https://blog.railway.com/p/introducing-railpack
177•mooreds•10h ago•74 comments

A tool for burning visible pictures on a compact disc surface

https://github.com/arduinocelentano/cdimage
113•carlesfe•13h ago•43 comments

Researchers develop ‘transparent paper’ as alternative to plastics

https://japannews.yomiuri.co.jp/science-nature/technology/20250605-259501/
364•anigbrowl•23h ago•225 comments

The FAIR Package Manager: Decentralized WordPress infrastructure

https://joost.blog/path-forward-for-wordpress/
174•twapi•16h ago•43 comments

OneText (YC W23) Is Hiring a DevOps/DBA Lead Engineer

https://jobs.ashbyhq.com/one-text/b95952a2-9bc2-4c3a-9da1-3dcc157b4a27
1•bluepnume•4h ago

Getting Past Procrastination

https://spectrum.ieee.org/getting-past-procastination
265•WaitWaitWha•18h ago•124 comments

What was Radiant AI, anyway?

https://blog.paavo.me/radiant-ai/
127•paavohtl•8h ago•80 comments

How we decreased GitLab repo backup times from 48 hours to 41 minutes

https://about.gitlab.com/blog/2025/06/05/how-we-decreased-gitlab-repo-backup-times-from-48-hours-to-41-minutes/
502•immortaljoe•1d ago•211 comments

Musk-Trump dispute includes threats to SpaceX contracts

https://spacenews.com/musk-trump-dispute-includes-threats-to-spacex-contracts/
97•rbanffy•8h ago•158 comments

I read all of Cloudflare's Claude-generated commits

https://www.maxemitchell.com/writings/i-read-all-of-cloudflares-claude-generated-commits/
191•maxemitchell•23h ago•185 comments

A year of funded FreeBSD development

https://www.daemonology.net/blog/2025-06-06-A-year-of-funded-FreeBSD.html
332•cperciva•1d ago•107 comments

Why are smokestacks so tall?

https://practical.engineering/blog/2025/6/3/why-are-smokestacks-so-tall
157•azeemba•20h ago•41 comments

PyOpticL – Code-to-CAD optical system engineering

https://github.com/UMassIonTrappers/PyOpticL
9•cinquemb•5h ago•1 comments

If it works, it's not AI: a commercial look at AI startups (1999)

https://dspace.mit.edu/handle/1721.1/80558
89•rbanffy•7h ago•50 comments

EFF to the FTC: DMCA Section 1201 Creates Anti-Competitive Regulatory Barriers

https://www.eff.org/deeplinks/2025/06/eff-files-comments-ftc-regarding-reducing-anti-competitive-regulatory-barriers
63•hn_acker•5h ago•2 comments

Log-Linear Attention

https://arxiv.org/abs/2506.04761
14•sva_•5h ago•3 comments

Reverse Engineering Cursor's LLM Client

https://www.tensorzero.com/blog/reverse-engineering-cursors-llm-client/
113•paulwarren•18h ago•23 comments

The Illusion of Thinking: Understanding the Limitations of Reasoning LLMs [pdf]

https://ml-site.cdn-apple.com/papers/the-illusion-of-thinking.pdf
311•amrrs•1d ago•168 comments

I'm Wirecutter's water-quality expert. I don't filter my water

https://www.nytimes.com/wirecutter/reviews/know-your-water-quality/
54•rufus_foreman•4h ago•118 comments

Hate Radio (2011)

https://rwandanstories.org/genocide/hate_radio.html
119•thomassmith65•7h ago•93 comments

Sharing everything I could understand about gradient noise

https://blog.pkh.me/p/42-sharing-everything-i-could-understand-about-gradient-noise.html
116•ux•1d ago•8 comments
Open in hackernews

Why Pandas feels clunky when coming from R (2024)

https://www.sumsar.net/blog/pandas-feels-clunky-when-coming-from-r/
62•Tomte•5h ago

Comments

great_wubwub•4h ago
I have no R experience but have been using Polars instead of Pandas for this sort of stuff and it feels less clunky. How does Polars compare to R?
j_bum•4h ago
I strongly prefer `dplyr` and the R stack for table processing and visualization.

But, recently I’ve been working with much larger scale data than R can handle (thanks to R’s base int32 limitation) and have been needing to use Python instead.

Polars feels much more intuitive and similar to `dplyr` to me for table processing than Pandas does.

I often ask my LLM of choice to “translate this dplyr call to Polars” as I’ve been learning the Polars syntax.

aydyn•3h ago
It blows my mind that in 2025 R is still limited to 2^31-1 rows. R needs a Python 3.0 moment, but that is unfortunately not going to happen for certain unfortunate but unnecessary reasons.
j_bum•3h ago
Yep. I have a deep love/hate relationship with R.

This is one of those decisions that I just do not understand. In your mind, why do you imagine a set of improvements won’t be made?

Otherwise, for now, working with Python and R using the reticulate package in Quarto is perfect for my needs.

If the Positron IDE could get in-line plot visualization in Quarto documents like the RStudio IDE has, I’d be the happiest camper.

BDPW•4h ago
I've had a similar experience from the opposite side. I've had quite a few years of experience in Python and had to work in R for an internship during my masters.

My impression was that it's pretty easy to do straightforward things like the examples described in the article. But when you have to do complicated or unusual things with your data I found it very frustrating to work with. Access to the underlying data was often opague and it was difficult to me at times to figure out what was happening under the hood.

Does anyone here know any research areas still using R?

Tomte•4h ago
Everyone in statistics, and lots of people applying statistics in other disciplines (anthropology etc.).
j_bum•4h ago
In addition to stats, R is widely used in computational biology and bioinformatics domains. It’s also widely used in the biopharma industry for a variety of other purposes.
mauritsd•4h ago
IME (bioinformatics PhD in the netherlands a number of years ago) it's mostly still preferred in a (pre-)clinical context, not so much in academia itself
kgwgk•4h ago
> My impression was that it's pretty easy to do straightforward things like the examples described in the article. But when you have to do complicated or unusual things with your data I found it very frustrating to work with.

That's where I realised that the "modern" approach was taken in the article - which obviously I had not looked at.

pteetor•3h ago
R is used extensively in quant finance. The quant traders, portfolio managers, and risk managers with whom I work all use R.
vharuck•3h ago
As an R user, I get what you mean. If you need to do things that don't fit well in the "tidyverse" model, you have three options:

1. Wrap the complicated bits in functions, then force it into the tidyverse model by abusing summarize and mutate.

2. Use data.table. It's very adaptable and handles arbitrary multiline expressions (returning a data.table if the last expression returns a list, otherwise returning the object as-is).

3. Use base R. It's not as bad as people make it out to be. You'll need to learn it to anyway, if you want to do anything beyond the basics.

tyfon•3h ago
Not really research pr se, but it's used extensively in banking here in Norway for anything from statistical model development to basic analysis and reporting.
dkdcio•4h ago
pandas* per the style guide (nobody follows it)

also I recommend trying Ibis. created by the creator of pandas originally and solves so many of the issues

https://ibis-project.org

jna_sh•4h ago
Any thoughts on ibis vs polars?
gnulinux•3h ago
Disclaimer: Never used Ibis before but I daily use polars and DuckDB.

It seems like Ibis uses DuckDB on its backend (by default) and has Polars support as well. Given this, maybe see if Ibis works better for you than polars. If you very specifically need polars, using that will for sure be better. DuckDB is faster than polars and it has great polars support, so depending on how Ibis is implemented it might be "better" than polars as data frame lib.

Vaslo•1h ago
There is also Nahwhals.

https://pypi.org/project/narwhals/#description

I tried really hard to use Ibis but I ran into issues where it was way easier to do some stuff in pandas/polars and had to keep coming out of Ibis to make it work so I gave up on it for the time being.

dleather•3h ago
I couldn't agree more. I'm fluent in languages like Julia, and MATLAB. I'm 90% fluent in R and prefer data.table over dplyr but working in both is easy enough. The past few months I've been fully transitioning to Python. And while base Python I find to be extremely elegant, typical data science and scientific computing workflows are a headache. There aren't just 1-2 packages to choose from for each use, every package has it's own syntax, keeping track of Pandas Series vs DataFrames is confusion. Want fast differentiable code? Then rewrite everything in numpy in JAX which requires its own tricks.

What Python desperately needs is a coordinated effort for a core data science /scientific computing stack with a unified framework.

In my opinion, if it weren't for Python's extensive use in Industry and package ecosystem, Julia would be the language of choice for nearly all data science and scientific computing uses.

hatmatrix•1h ago
> And while base Python I find to be extremely elegant, typical data science and scientific computing workflows are a headache.

That's my impression as well. Going back to the topic of the original post, pandas only partially implements the idioms of the tidyverse so you have to mix in a lot of different forms of syntax (with lambdas to boot) go get things done. Julia is much nicer, but I find myself using PythonCall more often than I'd like.

Scipy was originally supposed to provide the scientific computing stack, but then many offshoots in the direction of pandas / ibis / JAX, etc. happened. I guess that's what you get with a community-based language. MATLAB has its warts but MathWorks does manage to present a coherent stack on that end.

wodenokoto•2h ago
A really, really big part of this is thanks to RStudio, which, when you run a line and write a line will peek into memory to see what the columns in your dataframe is and understand the dplyr DSL to help you auto complete what essentially is non-existing variables.
smabie•2h ago
The original sin of Pandas is row indices
hatmatrix•1h ago
Actually I like that you can use it as a dictionary of tuples (i.e., rows).
Vaslo•1h ago
One of the big benefits of polars over pandas is not dealing with the constant index nonsense. Can’t tell you all of the issues I had as a beginner with pandas trying to debug silly index errors.
emehex•1h ago
I haven't seriously used R in nearly a decade but I still miss (and think about) dpylr and the hadleyverse...

A few years ago I made a package called "redframes" that tried to "solve" all of my frustrations with pandas, make data wrangling feel more like R, while retaining all the best bits of Python...

Alas, it never really took off. For those curious: https://github.com/maxhumber/redframes

hatmatrix•1h ago
Hey this looks pretty tidy.

There is so much hype and luck to widespread adoption, you never know with these things.

__mharrison__•1h ago
Lots of readers of my book, Effective Pandas, say it helps them feel more like they are used to with R...

(I've never used R myself, but certainly have some very strong opinions about Pandas after having written 3 books about it.)