Document that there's a limit in the size of the data that can be sent #339

cmdelatorre · 2015-03-27T17:44:42Z

As long as I understand, after reading #58 and other issues, raven-js uses HTTP GET requests to send data to Sentry. Apparently (and unlucky for me) it is a design decision related to the difficulties of handling CORS.

In any case, the fact is that there's a strict limit on the size of the error reports, that depends on the max url size supported by the Sentry web server. As I'm using the getsentry service, there's nothing I can do about it.

So, careless of how much I'd love to be able to POST longer reports to Sentry, I'd strongly recommend that you document this issue properly. I consider it to be very important and should be somewhere in the official documentation of the API.

Finally, thanks a lot for this great product.

mattrobenolt · 2015-12-31T00:32:58Z

This has changed drastically in 2.0.0 since we now send data via POST.

But, there is still a limitation depending on your server. If you're using on-premise, it's impossible to determine, but if you're talking to our servers (app.getsentry.com), our limit as of now is 100KB for the body.

If the message is too large, it'll be rejected.

@benvinegar We should add this to docs now.

jakubzitny · 2016-09-21T10:23:26Z

Hey guys, have you considered adding a check for the Sentry message size limit into Raven?

What do you think would be the best way to check / truncate the extra data that's over the limit?

(cc @benvinegar @mattrobenolt)

mattrobenolt · 2016-09-21T21:50:34Z

sentry.io has a limit that is effectively arbitrary for ourselves. This limit doesn't apply and may different for anyone running Sentry on-premise. Our limit as of today is 100k, but it's possible that in the future we may increase this size.

Now, there's no way to communicate this back to the server until it fails to make the request and is rejected. In theory, we could change the error that's kicked back to be JSON and be something like, {"error":413, "limit":100}, then trim and retry with that information it got. But this would be sorta specific to our implementation since this happens at our load balancer and not in the Sentry server itself.

it's also worth noting that we don't deal with this in other clients nearly as often since they all gzip their payloads before transmitting, so it's very rare to cross our 100k limit. But for JavaScript, we're not able to do this easily without bloating raven-js with a gzip library. (Ironic that we can't easily gzip from a browser, huh?)

jakubzitny · 2016-09-22T09:19:03Z

Thanks for the reply @mattrobenolt.

I understand the situation, but what if this was configurable? When configuring or instantiating the Raven client I'd add a parameter specifying the limit for one event and the client would automatically trim the contents somehow. Would it make sense?

(cc @vojtatranta)

benvinegar · 2016-09-22T17:21:11Z

There's also not a really easy way to say "limit this to 100kb". Users can pass extra values with arbitrarily deep objects and sub-objects, so we'd need an algorithm that basically traverses the entire tree and decides where to trim large items. And there's a ton of tricky cases to handle, e.g. how does it know that an array of 100 small items is less bytes than an array of 10 items with large items? This algorithm could become really complex, really fast.

We've experimented with payload size, and found that for a typical maxed out stack trace, and 100 breadcrumbs, the payload is roughly ~15 KB. That's not a lot of bytes for a ton of information. Users only really hit the 100kb limit if they add arbitrarily large arrays/objects to extra manually.

I think if anything, we should add a note to the extra config warning that there is an upper size limit, and users shouldn't indiscriminately add complex objects of unknown size.

jakubzitny · 2016-09-22T19:06:04Z

Yeah that's true.

Actually, the large objects in extra is exactly our case. We're attaching some logs (little bit similar to breadcrumbs), that can really be "misused" by a developer and filled with unneccessarily big objects. That's why we're contemplating about the limits there and what'd be the best way to trim it.

davishmcclurg · 2017-01-05T20:39:54Z

I just ran into this with an error that had a bunch of breadcrumbs. It would be nice if the client would re-send the error without breadcrumbs or extra data so that it doesn't disappear completely.

davidfurlong · 2017-10-18T18:22:36Z

With redux sentry middleware our entire redux state gets sent in extra. Now we're getting this. Any idea of a current estimate on max size? Looks like one request was around 240KB

kamilogorek · 2017-10-19T09:23:23Z

Any idea of a current estimate on max size?

It's currently 100kB. You can use dataCallback to strip something that's unnecessarily big and you know you can skip it.

davidfurlong · 2017-10-19T09:43:00Z

It's currently 100kB. You can use dataCallback to strip something that's unnecessarily big and you know you can skip it.

Yeah thanks I found the relevant docs. isn't dataCallback a config option, and not for individual raven _sends? Is the best practice to set the config and then unset it for an individual request? Additionally I feel that the best policy is to report the error to sentry without the heavy extra payload (or breadcrumbs or stack) rather than simply failing on the client... As many errors which can be caught in the use error reporting libraries should be reported to the error monitoring service IMO,

kamilogorek · 2017-10-19T11:32:22Z

It is indeed a global callback, which will be called for every event.

Additionally I feel that the best policy is to report the error to sentry without the heavy extra payload (or breadcrumbs or stack) rather than simply failing on the client...

It's not that easy to do. We cannot simply calculate the size of the request and strip something if it's too large. We have to be performant and low in size. Adding something like this would require serialization of a whole payload, calculating the size and recursively trying to get it down to "sendable" size, which can be just too heavy process if someone will send tenths of errors in a very short period of time.

davidfurlong · 2017-10-19T13:09:14Z

Yeah I see that that is an issue - I looked at doing this myself. But Im not suggesting trying to prune dynamic parts of the data - Im suggesting only sending the parts that are truly essential and which cannot collectively be too big. My suggestion would be: on a '413 Request entity too large' sentry retries the same request, but this time without sending `extra`, `breadcrumbs` and `stack` (and any other data which could be (realistically) unbounded in size).

…

On Thu, Oct 19, 2017 at 1:32 PM, Kamil Ogórek ***@***.***> wrote: It is indeed a global callback, which will be called for every event. Additionally I feel that the best policy is to report the error to sentry without the heavy extra payload (or breadcrumbs or stack) rather than simply failing on the client... It's not that easy to do. We cannot simply calculate the size of the request and strip something if it's too large. We have to be performant and low in size. Adding something like this would require serialization of a whole payload, calculating the size and recursively trying to get it down to "sendable" size, which can be just too heavy process if someone will send tenths of errors in a very short period of time. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#339 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAlhcP5xGZObiITHpqQV039kBtdbPaY4ks5stzNIgaJpZM4D2Amd> .

captbaritone · 2017-10-28T02:58:08Z

Clearly the best solution to the problem of excessively large payloads, is to trim them down, and we have nice APIs to do just this (Raven.setDataCallback). However, it can still be difficult to know exactly how to transform your data. This difficulty is compounded by the fact that you must err on the side of extreme caution since if you do go over, it fails silently. As users of Raven+Sentry this is the last thing we want! 😄

So, at the very least it would nice to know when you've sent an exception that was rejected by Sentry. I understand the infeasibility of doing this on the server, but as @davidfurlong points out, it could easily be done on the client.

I took a look at implementing this outside the library, but I don't think it's currently possible without depending upon some private implementation details of Raven.

Something like this does seem to work, but it relies on being able to access the default transport implementation via Raven._makeRequest (Not a good solution)

Hack:

function captureMessageWithoutContext(message) {
  Raven.setDataCallback((data, originalCallback) => {
    Raven.setDataCallback(originalCallback);
    data.breadcrumbs.values = [];
    return data;
  });
  Raven.captureMessage(message);
}

const originalTransport = Raven._makeRequest;
Raven.setTransport(opts => {
  const originalOnError = opts.onError;
  opts.onError = error => {
    originalOnError(error);
    if (error.request.status === 413) {
      captureMessageWithoutContext(
        "Failed to submit exception to Sentry. 413 request too large. " + error
      );
    }
  };
  originalTransport(opts);
});

I think if we included the actual XHR error, or even just the status code, in the ravenFailure event, it would be possible to implement this kind of recovery outside of Raven itself.

413 detection may very well be something that should live inside of Raven, but in the mean time, this would be a nice escape hatch and would let the community explore ways to handle 413 errors.

This topic is especially interesting to me, since raven-for-redux force users to manually trim down their data in order to avoid hitting the size limit, and I currently feel like I may be doing more hard then good, since I am leading them down a path which may very well be silencing real errors in their apps.

davidfurlong · 2017-10-28T11:18:38Z

413 detection may very well be something that should live inside of Raven, but in the mean time, this would be a nice escape hatch and would let the community explore ways to handle 413 errors.

This topic is especially interesting to me, since raven-for-redux force users to manually trim down their data in order to avoid hitting the size limit, and I currently feel like I may be doing more hard then good, since I am leading them down a path which may very well be silencing real errors in their apps.

I agree that 413 should live inside of Raven, but even just making it easier to implement your own 413 handling would be a big improvement.

Currently struggling to balance:

Sending as much relevant redux state
Not going over the size limits (both 100K and the imposed size limits per top level value in extra (I destructure my redux state for this)

The issue is that our redux state can vary tremendously in size & 413s can be occurring despite the trims. Redux state is a nice to have when debugging, but if it leads to a nontrivial probability the error wont be reported due to a 413, then its not worth it IMO. So no redux state in sentry :(. Sending just a bounded size subobject of the redux state is tricky and much less useful than having a more complete version. Additionally trimming dynamically makes it harder to debug because you don't know whether missing data is the source of the bug, or just a result of trimming..... I almost want encode and compress the whole state

kamilogorek · 2018-01-19T11:28:09Z

We can most likely utilize Node's serializer here as well. Just a note to myself.

Akash91 · 2018-07-16T15:16:25Z

Why is this not there in documentation ?

wjdp · 2018-07-26T15:33:48Z

Just come across this after several users sent in support requests about a frontend application. I'm rather surprised raven doesn't handle this given sentry's tagline "Stop hoping your users will report errors".

Also a hit to our adoption of sentry!

(We're getting big-ish breadcrumbns ~600k when catching Vue exceptions)

beaugunderson · 2018-08-06T04:18:57Z

with recent Chrome the limit is even smaller (65536 bytes) due to a bug: #1464

edit: not a bug! part of the fetch() spec, so { keepalive: true } should probably not be the default, or the docs updated to reflect that limit.

mattrobenolt added the Documentation label May 16, 2015

mattrobenolt added the Patch needed label Dec 31, 2015

captbaritone mentioned this issue Mar 18, 2017

configuration option to weather or not to include the whole action ngokevin/redux-raven-middleware#24

Closed

bjunc mentioned this issue Apr 18, 2017

RavenVue / Vue plugins: 413 Request Entity Too Large #937

Closed

kamilogorek removed Patch needed labels Sep 11, 2017

davidfurlong mentioned this issue Oct 18, 2017

When the redux state is really big, the request to sentry fails due to Request too large captbaritone/raven-for-redux#42

Closed

pgrm mentioned this issue Nov 16, 2017

fix: Typescript typings #1134

Merged

3 tasks

kamilogorek added the Type: Improvement label Jan 19, 2018

kamilogorek mentioned this issue Apr 24, 2018

How can we avoid additionnal data to be truncated? #1309

Closed

HazAT added the raven-js label Jun 12, 2018

kamilogorek mentioned this issue Aug 28, 2018

ref: Remove keepalive:true as a default and document payload size #1496

Merged

kamilogorek closed this as completed in #1496 Sep 4, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document that there's a limit in the size of the data that can be sent #339

Document that there's a limit in the size of the data that can be sent #339

cmdelatorre commented Mar 27, 2015

mattrobenolt commented Dec 31, 2015

jakubzitny commented Sep 21, 2016

mattrobenolt commented Sep 21, 2016

jakubzitny commented Sep 22, 2016

benvinegar commented Sep 22, 2016

jakubzitny commented Sep 22, 2016

davishmcclurg commented Jan 5, 2017

davidfurlong commented Oct 18, 2017 •

edited

Loading

kamilogorek commented Oct 19, 2017

davidfurlong commented Oct 19, 2017

kamilogorek commented Oct 19, 2017

davidfurlong commented Oct 19, 2017 via email

captbaritone commented Oct 28, 2017

davidfurlong commented Oct 28, 2017 •

edited

Loading

kamilogorek commented Jan 19, 2018

Akash91 commented Jul 16, 2018

wjdp commented Jul 26, 2018

beaugunderson commented Aug 6, 2018 •

edited

Loading

Document that there's a limit in the size of the data that can be sent #339

Document that there's a limit in the size of the data that can be sent #339

Comments

cmdelatorre commented Mar 27, 2015

mattrobenolt commented Dec 31, 2015

jakubzitny commented Sep 21, 2016

mattrobenolt commented Sep 21, 2016

jakubzitny commented Sep 22, 2016

benvinegar commented Sep 22, 2016

jakubzitny commented Sep 22, 2016

davishmcclurg commented Jan 5, 2017

davidfurlong commented Oct 18, 2017 • edited Loading

kamilogorek commented Oct 19, 2017

davidfurlong commented Oct 19, 2017

kamilogorek commented Oct 19, 2017

davidfurlong commented Oct 19, 2017 via email

captbaritone commented Oct 28, 2017

Hack:

davidfurlong commented Oct 28, 2017 • edited Loading

kamilogorek commented Jan 19, 2018

Akash91 commented Jul 16, 2018

wjdp commented Jul 26, 2018

beaugunderson commented Aug 6, 2018 • edited Loading

davidfurlong commented Oct 18, 2017 •

edited

Loading

davidfurlong commented Oct 28, 2017 •

edited

Loading

beaugunderson commented Aug 6, 2018 •

edited

Loading