Questions tagged [data-explorer]
For questions about the functionalities, queries, databases and datamodel of The Stack Exchange Data Explorer (SEDE). SEDE allows you to create and run queries against copies of the databases of popular Stack Exchange sites.
1,574
questions
3
votes
1
answer
54
views
How can I search for tags with that include a specified string in their Usage Guidance?
I would like to search for all tags (on Stack Overflow) that include some specific string in the Usage Guidance portion of their Tag Wiki. How can I do that?
Specifically I would like to search for ...
9
votes
1
answer
193
views
What is going on with SEDE (and SO)? Can it be used only after the weekly update ends?
It seems that at the moment:
After clicking on compose query you get: "Server Error in '/' Application. - Runtime Error". But it works if you use a different site instead of SO.
Running any ...
-5
votes
0
answers
136
views
How can I run SEDE queries on my computer? [duplicate]
Some SEDE queries time out on SEDE. How can I run SEDE queries on my computer?
-1
votes
1
answer
173
views
Increase max SEDE query running time
I see an increasing amount of:
Line 0: Execution Timeout Expired. The timeout period elapsed prior to completion of the operation or the server is not responding.
errors (screenshot) on SEDE due to ...
17
votes
1
answer
268
views
Can we exchange one index for another one?
In the Stack Exchange Data Explorer only the most logical/beneficial yet minimal set of indexes is created for the tables found in the schema.
One of those indexes is on the Users table:
CREATE ...
7
votes
2
answers
125
views
How should I go about retrieving the date of when a question became/was no longer hot with SEDE?
I am currently making a SEDE query to determine which questions from a specific user have made it onto the Hot Network Questions list and when. While I can get the specific questions, I am unable to ...
-1
votes
1
answer
120
views
Data explorer query takes a lot of time and needs to be optimized
This is a Stack Exchange Data Explorer query which gets the top users on specific skill scores (example: JavaScript) based on a specific country (example: Egypt), and it took a lot of time until it ...
7
votes
2
answers
249
views
Data Explorer Query crashes with collation conflict error
I have a website built off my Q & A posted in Stack Exchange sites such as; Ask Ubuntu, Stack Overflow, SE Security, etc.
Data Explorer is used to scrape ALL Stack Exchange Q & A I've posted ...
5
votes
1
answer
101
views
Are SEDE results ever cached on client side, or solely on server side?
I know that SEDE query results are cached on the server side. I wonder if in some cases they are also cached on the client side.
5
votes
1
answer
163
views
How can one see when a query was cached in SEDE?
Query results in SEDE are cached:
How can one see when a query result was last cached in SEDE?
25
votes
2
answers
702
views
Create a view that provides a list of databases
In an answer to Strange new database 'Data.StackExchangePOC' is messing up cross-site SEDE queries our DBA extraordinair Aaron Bertrand suggested to have a feature request for a view to list ...
14
votes
1
answer
271
views
Tags column of Posts table changed delimiter format. Many historical queries broken. Special tag rendering also broken
To the best of my memory, and in reading my past queries- some of which I re-run on a regular basis- the format of the Tags column of the Posts table in SEDE used to be like <starball><is>&...
14
votes
1
answer
555
views
What caused the drop in questions in April 2023?
Rerunning this query by starball for new questions and answers posted across the network generated the following graph:
It plots posts by their creation date, and includes deleted posts.
The decline ...
2
votes
0
answers
95
views
Logging out doesn't log out from data.stackexchange.com
Logging out says it logs out from stackexchange.com but that's incorrect as it doesn't log out from data.stackexchange.com.
Tested on Chrome with Android on a Galaxy S23.
10
votes
2
answers
171
views
How can I write and run a SEDE query without the query itself being visible to other regular SEDE users?
How can I write and run a query in SEDE without the query itself being visible to other regular SEDE users, such as in the "everything" tab of the "queries" page?
7
votes
1
answer
117
views
What exactly is the "sql" parameter in SEDE and what purpose does it serve?
SEDE supports query parameters. I found that if I try to define a query parameter of string type named "sql", whatever I pass as the argument for that parameter will be prepended by a string ...
4
votes
1
answer
99
views
How can I see the number of questions that a given user voted to close but got later reopened by someone else?
I wonder the following: for a given user, how can I see the number of questions that they voted to close, but the question was later reopened by someone else (possibly a group of users)?
7
votes
1
answer
144
views
What site should I post a question about an error given from a SEDE (Stack Exchange Data Explorer) query?
I am working on composing a SEDE (Stack Exchange Data Explorer) query to find questions from 3-4 months ago that do not have any accepted answers that I feel like I could answer (as in limiting it to ...
26
votes
3
answers
555
views
Strange new database 'Data.StackExchangePOC' is messing up cross-site SEDE queries
I just read a comment by user @questionto42 that one of my cross-site queries isn't working. My educated guess is this is because a special database Data.StackExchangePOC was created two days ago:
(...
12
votes
0
answers
138
views
The SE Data Explorer captcha is unnecessarily annoying
Whenever I try to run a query with the Data Explorer, my process is:
Type the parameters for the query.
Click "Run query".
Click the "I'm not a robot" checkbox that suddenly ...
16
votes
0
answers
289
views
What's going on with SEDE right now? It's being fickle
When I open a new tab, I get "we are currently offline", and then I refresh a couple of times and it's fine.
Then when I try to run a query, it will give
unknown job being polled!
or
...
6
votes
1
answer
145
views
Periodically publish a data dump of all IDs in the image ID space (and make it available in SEDE)
I see that Imgur has an API to query for image IDs in its ID space, but api.stack.imgur.com is not an API endpoint (and SE is moving off Imgur anyway). Say I want to get all the IDs of images in Stack ...
20
votes
2
answers
416
views
The Data Explorer (SEDE) Refresh choked on a Raspberry
As we all know the community eagerly awaits the weekly SEDE refresh with all our posted content so we can analyze, share-alike or train our large language models (LLMs).
The script that runs the ...
0
votes
0
answers
144
views
Get statistics about self-deleted accounts
It's interesting to know is there a possibility to get the number of deleted users on site per specified period?
Additional information like reputation, number of posts etc on the moment of deletion ...
16
votes
1
answer
270
views
Magic columns and site:// do not work for alcohol.meta.stackexchange
The site Beer, Wine & Spirits has the URL https://alcohol.stackexchange.com but in SEDE it uses beer for the main site and beerme for meta.
As far as I can tell, the shortcut site:// and the magic ...
3
votes
1
answer
143
views
How can one see how many regular users cast at least one reopen vote during the current year?
I am looking at some question reopening statistics. How can one see how many regular users cast at least one reopen vote during the current year? Preferably with the count of how many reopen votes ...
1
vote
1
answer
143
views
SEDE query to pull votes on a post within 90 days of its creation date
I am trying to query SEDE so I can count the upvotes (and I would also like to count the downvotes in its own column) that happen within 90 days after the question or answer was created.
I have tried ...
10
votes
0
answers
158
views
Some Users have an AccountId being Zero
We know account creation is finicky from answers to questions like Why is my profile messed up on a single site and how can I revert it? among others.
In some of these cases you end-up with a user-...
1
vote
1
answer
77
views
How to query SEDE for user sign-up dates for a specific community
We are looking to query a list of users to receive the sign-up dates to the Stack Overflow sub-community. Does anyone have a suggestion how to do this via SEDE?
9
votes
0
answers
113
views
How can we check how often questions sent to the close queue age out?
My site is dealing with a rather long close vote review queue and we believe reviews are regularly ageing out. We want to address the problem in Meta and motivate it with stats over time about how ...
2
votes
0
answers
112
views
Should Windows Phone site be removed from SEDE?
I have noticed that it is possible to see Windows Phone as one of the sites where one can run SEDE queries. As an example, here is a query listing the recent posts.
This query shows the date as the ...
2
votes
0
answers
127
views
The visual human test sometimes appears very far away from the captcha checkbox in SEDE
The visual human test sometimes appears very far away from the captcha checkbox:
(click for full size)
Tested on an iPhone with Chrome. This is inconvenient because one has to unzoom to find it. It’...
4
votes
1
answer
765
views
How can I view unread messages?
At one point I had up to a hundred of unread messages. I read many of them, but the notification icon says there are four I have not. Because of the sheer number of the messages, I am having ...
8
votes
1
answer
96
views
Can you save an edit to a SEDE query's title or description without editing the query?
I don't create queries in SEDE much but when I do, I forget to give them a useful title and description until after I'm finished creating them and click the permalink button.
In an effort to avoid ...
5
votes
2
answers
249
views
Search for comments on posts with a certain tag based on a keyword
I am trying to get the comments on posts with a specific tag which include a phrase or a keyword of interest using SEDE. Here's my first attempt: https://data.stackexchange.com/stackoverflow/query/...
-8
votes
1
answer
119
views
Finding my upvotes on Data Explorer
This T-SQL finds all the questions I have answered on SO.
https://data.stackexchange.com/stackoverflow/query/1779459/questions-i-have-answered
I want to determine if I upvoted the question or not (as ...
9
votes
1
answer
215
views
SQL query used to calculate staff escalation statistics doesn't take into account different tag names for international sites
This is a bug report of a SQL query used at What posts should be escalated to staff using [status-review], and how do I escalate them?.
In the above post, status-review posts are counted using this ...
-9
votes
1
answer
118
views
Add option to download a message in SEDE
In SEDE, one can download the results as a CSV file, but there are no options to download a message.
Feature request: Add an option to download a message in SEDE. That'd be useful for queries ...
4
votes
1
answer
60
views
Can I see other user's badge progress and/or tag progress short of doing SEDE query?
I can see my own badge/tag progress in the "Select your next badge" dialog box from the User Activity screen. For example, I can see that I'm 455/500 for a gold Copy Editor badge in ...
3
votes
1
answer
121
views
Discrepancy in Post Counts between Local MySQL Stack Overflow Database and Data Stack Exchange Query
I recently downloaded the Stack Overflow data from Stack Overflow datadump and imported it into my local MySQL database. While attempting to retrieve the total number of posts for the year 2021 using ...
1
vote
0
answers
95
views
What am I supposed to do when I get what is probably the "requires external JavaScript" error for the chat domains and SEDE?
I'm suddenly getting this error in my main sites (like SO and MSE)
Meta Stack Exchange requires external JavaScript from another domain, which is blocked or failed to load. Retry using another source....
63
votes
11
answers
4k
views
The company's commitment to the data dumps, the API, and SEDE
Many words have been written around the company's commitment to the ongoing existence of the data dumps, the API, and the Stack Exchange Data Explorer (SEDE). Much of that text can be confusing or ...
9
votes
2
answers
278
views
What exactly do VoteTypeId=4 (offensive) and VoteTypeId=12 (spam) mean?
According to the Database schema documentation for SEDE, the table Votes contains as VoteTypeId 4 = Offensive and 12 = Spam. The latter was mentioned, for example, here: How to find posts closed as ...
3
votes
1
answer
523
views
How can I download a small data set (1 year 2023-2022) from Stack Exchange?
I want to download a small dataset for one year. I didn't find a link that allows me to download 1 year of data or the last five years' data.
I tried "https://data.stackexchange.com/stackoverflow/...
2
votes
2
answers
196
views
Does Stack Exchange offer any incentives related to use of their data explorer in research?
I'm interested in whether or not there have historically been this sort of partnership with research in the academic space or if it is planned for the future. This could be something like a grant, ...
5
votes
1
answer
187
views
Ranking users with most edits and close votes on a specific tag
A while back I asked this question about writing a query which would rank users based on their number of close votes on posts related to a specific tag: Table of users ranked based on their number of ...
1
vote
0
answers
40
views
View "people reached" data/trend through time for user
Is there a way to view a trend (graphically or tabulated) in the number of "people reached" per user per site?
E.g., to see how many people a user's posts impacted monthly or annually, etc.
...
5
votes
2
answers
167
views
Table of users ranked based on their number of close votes in a specific tag
I am trying to get the list of users that have the most number of close votes (tallying each type of close votes; i.e. Duplicates, Too Broad, Need MRE, Not About Programming, etc.) in a specific tag. ...
5
votes
0
answers
40
views
Add a bit column for whether a badge is retired in SEDE or a RetiredDate column
It would be nice (for example) to be able to easily filter out badges that are retired when querying SEDE for analysis purposes.
A bit for it would work, or alternatively, a nullable RetiredDate.
Or ...
3
votes
1
answer
74
views
Why does this SEDE query think that the MathOverflow Meta is mathoverflow.meta.stackexchange.com?
I ran a query created by @peterh to find all my network posts. It works perfectly for the task, but it believes that all my posts made to MathOverflow Meta were actually made to https://mathoverflow....