• 0 Posts
  • 3 Comments
Joined 1 year ago
cake
Cake day: July 4th, 2023

help-circle
  • My guess is that it’s more a result of overfitting for alignment. Fine-tuning for “safety” (rather, more corporate-friendly outputs).

    That is, by focusing on that specific outcome in training the model, they’ve compromised its ability to give well-“reasoned” “intelligent” sounding answers. A tradeoff between aspects of the model.

    It’s something that can happen even in simple statistical models. Say you have a scatter plot of data that loosely follows some trend, and you come up with two equations to describe that trend. One is a simple equation that loosely follows it but makes a good general approximation, and the other is a more complicated equation that very tightly fits the existing data. Then you use those two models to predict future data. But you find that the complicated equation is making predictions way off the mark that no longer fit the trend, and the simple one still has a wide error (how far its prediction is from the actual data) but still more or less accurately fits the general trend. In the more complicated equation, you’ve traded predictive power for explanatory power. It describes the data you originally had but it’s not useful for forecasting data that follows.

    That’s an example of overfitting. It can happen in super-advanced statistical models like GPT, too. Training the “equation” (or as it’s been called, spicy autocorrect) to predict outcomes that favor “safety” but losing the model’s power to predict accurate “well-reasoned” outcomes.

    If that makes any sense.

    I’m not a ML researcher or statistician (I just went through a phase in college), so if this is inaccurate I’m open to corrections.


  • I think you’re right; it will probably never have particularly wide reach, but it will (and to some extent does have) deep appeal.

    What I mean is that people who are attracted to a platform like Lemmy are the kind of people who are likelier to have those niche passions and knowledge on those topics. And they are the kind of people who are also likelier to participate in communities around those things. No, not everyone, and yes there are still communities with a broader appeal and less depth, but I think my point is clear enough. It’s just kind of intrinsic to how the platform works and how it is positioned in the broader internet space.