112

Recently, while searching for some technical stuff using Google, a peculiar link came up with the other search results. I tried to visit the site, but it looks like an exact clone of http://stackoverflow.com

What is that? What is the link between SO and http://heima588.com?

23
  • 42
    No link other than that that site is violating the copyright and trademark of Stack Overflow. Commented Aug 4, 2014 at 11:51
  • 8
    The site is also badly broken. Commented Aug 4, 2014 at 11:52
  • 2
    @MartijnPieters It is an exact copy, So what would be the next course of action from our side. Commented Aug 4, 2014 at 11:53
  • 34
    Report the site to the team: Updated procedure for reporting SCRAPERs Commented Aug 4, 2014 at 11:53
  • 3
    @MartijnPieters Reported the details to the team. :) Commented Aug 4, 2014 at 12:07
  • 40
    It would be ironic if they had a meta.heima588.com too :)
    – user3717756
    Commented Aug 4, 2014 at 12:16
  • 34
    I'm following a more direct path than usual in making sure someone at the executive level sees this. Holy pixel-by-pixel copies, Batman!
    – user50049
    Commented Aug 4, 2014 at 12:37
  • 9
    @SufiDeveloper: did you actually try it? (warning: NSFW content) :p Commented Aug 4, 2014 at 12:39
  • 2
    "Imitation is the sincerest of flattery" - Charles Caleb Colton. Though, I wonder if he would feel the same way about cloning. Commented Aug 4, 2014 at 15:59
  • 7
    @ComFreek Only the homepage is. Add /questions/ and … oh now it's also down. Via Google, most cloned questions have a PHP warning about full disk on VPS. SO won by KO :)
    – FelipeAls
    Commented Aug 4, 2014 at 16:37
  • 1
    Why did it get down?
    – frogatto
    Commented Aug 4, 2014 at 17:04
  • 1
    @Qantas94Heavy Hahaha tried it now and it looks very original :D
    – user3717756
    Commented Aug 4, 2014 at 19:00
  • 4
    @gordonlinoff I think there is a real risk that passwords would be compromised yes.
    – Floris
    Commented Aug 5, 2014 at 22:41
  • 6
    maybe we can redirect help vampires to that site
    – gitsitgo
    Commented Aug 6, 2014 at 19:36
  • 3
    By the way, this heima site now appears to be a chinese shopping site.
    – gitsitgo
    Commented Jun 2, 2015 at 13:52

6 Answers 6

78

There is no link between Stack Overflow and that site, other than that, the site does a bad job of copying all of Stack Overflow.

The site is violating Stack Exchange copyright and trademark by copying the design, logo and the contents, see the footer where it says:

site design / logo © 2014 stack exchange inc

Various parts of the site are badly broken too.

Feel free to report the site to the Stack Exchange, using the A site (or scraper) is copying content from Stack Exchange. What do I do?.

4
  • I still don't understand something ... how do they exact copy the entire site. is it kind a saved/cached pages? bit confused.
    – Rahul
    Commented Aug 4, 2014 at 20:07
  • 1
    @Rahul: in this case they are copying the HTML, leaving intact the stylesheets and image references. As a result, your browser styles the HTML exactly the same way Stack Overflow is styled. Commented Aug 4, 2014 at 20:34
  • 6
    LOL .. then they have put enough effort to do this sort of smuggling :). BTW, the site is total down now.
    – Rahul
    Commented Aug 4, 2014 at 20:37
  • 3
    A link from the SO homepage to this question sends enough traffic to bring the site down. :) Commented Aug 6, 2014 at 19:18
123

This doesn't appear to be a scraper, but a proxy of some kind, which appears to run about 10 minutes behind the main site. As it appears that the site is hosted somewhere in China, this could be a way someone has devised in order to make Stack Overflow accessible. I don't know - we're looking into it some more.

I would not attempt to log into that site to poke and explore it, for obvious reasons.

16
  • 7
    Me neither. I don't trust these hacky-wacky sites. Might be full of trojans/malwares. Commented Aug 4, 2014 at 12:50
  • 12
    The sign up and log in links in the top bar are linking to https://stackoverflow.com/, perhaps because the proxy replaces all instances of http://stackoverflow.com but not https links.
    – user247702
    Commented Aug 4, 2014 at 12:57
  • 1
    If the proxy runs 10 minutes behind then it is still a scraper as it must have scraped a copy 10 minutes earlier to store it. :-P I know, pendantry will not get me anywhere.. Commented Aug 4, 2014 at 15:41
  • 5
    @MartijnPieters There's a line between caching (as a typical squid proxy does) and scraping (saving a copy as a copy, not something that expires).
    – user50049
    Commented Aug 4, 2014 at 15:42
  • Sure, I am not being serious here. Glad this is taken seriously, btw. Commented Aug 4, 2014 at 15:45
  • 43
    Ah, America and China, the extreme opposite ends of the Copyright spectrum. Commented Aug 4, 2014 at 15:53
  • 3
    @TimPost You say "a way to make Stack Overflow accessible", do you mean that the real SO is normally blocked in China?
    – Mr Lister
    Commented Aug 5, 2014 at 5:07
  • 9
    What's that you say? 10 minutes in the past? A new frontier of time-travel-enabled fastest-gun-in-the-west lies before us. Commented Aug 5, 2014 at 5:38
  • 3
    @MrLister SO in China was having problems with the JavaScript in the recent past due to the Google blockage (but was still semi-usable), but everything has been a-ok for a while now.
    – Xiaofu
    Commented Aug 5, 2014 at 5:46
  • If this is truly something nefarious and/or commercial AND is being hosted in the PRC then you could report them to the relevant authorities (can't tell you who exactly). Sites hosted in China must be registered and display their corresponding ICP number at the bottom of the homepage. E.g. see Baidu for an example: ICP证030173号. Since they don't have this they can potentially be shut down. And if they did have this then you could track them down...
    – Xiaofu
    Commented Aug 5, 2014 at 5:55
  • 2
    Funny, we were just talking about accessing SO from China.
    – user456814
    Commented Aug 5, 2014 at 6:06
  • 1
    It's not even 10 minutes behind the main site right now, maybe 2-3 minutes max. I had 3 upvotes on stackoverflow.com/questions/25146969/… 5 minutes ago, now I have 5, and heima588.com/questions/25146969/… reflects this.
    – AStopher
    Commented Aug 5, 2014 at 20:07
  • 3
    @zyboxinternational It's definitely a proxy - just one that has a hard time keeping up (and well, considering our volume, that's sort of expected). Devs are looking into it.
    – user50049
    Commented Aug 6, 2014 at 16:42
  • @TimPost As I mentioned in my answer, it's probably a low-end VPS that's hosting the site. It would make a LOT of sense in this instance...
    – AStopher
    Commented Aug 6, 2014 at 17:41
  • @TimPost meta.stackoverflow.com/a/267976/2518525
    – Darren
    Commented Aug 7, 2014 at 6:50
22

It is still up by the way. I know this should be a comment and not an answer, but I don't have the necessary reputation to comment.

http://heima588.com/questions/24969251/how-to-display-pdf-file-contents-as-well-as-its-full-name-in-the-browser-using-c

Link. The home page is down, so is /questions, but /answers lets me access the site, most of which redirects to the legitimate stackoverflow.com

10
  • 21
    I must say, leaving a comment as an answer on MSO is so ironic I almost have to upvote it. I won't bother with the semi-rant that tells you to be patient until you have enough rep to comment...so, I suppose I will... Commented Aug 5, 2014 at 5:54
  • 3
    I figured it was important to tell people that the website is still up.
    – Bretsky
    Commented Aug 5, 2014 at 5:56
  • No it's not. From yesterday night I tried almost 10 times and can't access. I think that site is not accessible from India.
    – Rahul
    Commented Aug 5, 2014 at 9:46
  • It is accessible as of right now, in Canada, but only some parts of it work.
    – Bretsky
    Commented Aug 5, 2014 at 19:51
  • It's also not only still accessible from that link in the US, but the homepage works for me as well
    – Izkata
    Commented Aug 5, 2014 at 20:06
  • 7
    Why is it important to frequently report if another site is up or down on MSO? Commented Aug 5, 2014 at 20:12
  • This proxy site obviously has limited bandwidth and since this thread was posted, it's been going down quite frequently.
    – AStopher
    Commented Aug 5, 2014 at 20:17
  • @martin-capodici Because the site is a complete copy of SE.
    – Bretsky
    Commented Aug 5, 2014 at 20:19
  • @Bretsky It's not an exact copy, it's a proxy that seems to be caching each page once every 10 minutes-or-so. I'm timing how often it updates (by picking up a comment I made on a question). Once done, I'll post an answer to this..
    – AStopher
    Commented Aug 5, 2014 at 20:21
  • There. You now have enough reputation to comment. Commented Dec 19, 2014 at 3:01
14

I personally find this sad. It could be a legitimate reason like the host country has blocked Stack Overflow and this site is acting as a proxy, but I've done some digging.

When running a ping on this site:

ping heima588.com

The returned IP is: 103.6.84.70

Reverse IP

A reverse IP lookup on heima588.com warrants the following domains (Source)

32ie.com            www.byby-sh5.com
heima588.com        www.diandaomedia.com
longhack.com        www.heima588.com
www.bjxdqcyp.com    www.tumourhealth.com

None of the domains hosted on the same IP above seem to bear any resemblance to the trouble domain (heima588.com). So I ruled them out of the equation for now.

The originating IP address resolves to Honk Kong, China. (Source)

Doing some more snooping, and looking at the DNS Name servers, it seems that the IP & subsiquently the sites, are hosted through Mongit / Host Virtual (Host Virtual seems to host VPS installations in Hong Kong, China (Source)).

ASN Lookup

The ASN lookup warrants the same information, it is owned by Host Virtual, or should I say it is hosted by them. (Source #1, Source #2)

And looking at the IP Block associated, we see that our returned IP is in fact hosted via Host Virtual through "China Mobile". (Source)

Conclusion

I might have found the culprit/guilty party at fault here. I've stumbled upon a person that talks about, and I quote:

As Chinese government banned many foreign websites like youtube, blogger, facebook and so forth, i feel the crisis of human rights in China!

I'm not going to post/publish the name/information on here, but if a moderator would like to contact me to verify/get this info to see if it is indeed so, please do if you haven't already found out who the party at fault here is. Just trying to help out here.

6
  • 2
    Looks like a person with a good motive using the wrong means. Commented Aug 7, 2014 at 6:51
  • 1
    @InfiniteRecursion Exactly what I was thinking. This person doesn't seem the slightest bit malicious from what I can find, but you can never judge a book by its cover.
    – Darren
    Commented Aug 7, 2014 at 6:51
  • 2
    I don't understand why there would be such a pressing need to provide a proxy for Stack Overflow. News and social media sites, sure, but is access to Stack Overflow really such a pressing human rights need?
    – user456814
    Commented Aug 7, 2014 at 6:53
  • @Cupcake Take it the chinese government doesn't want its people to know how to code properly or solve any issues.
    – Darren
    Commented Aug 7, 2014 at 6:59
  • 1
    Man!! that's a nice investigation Eh!!
    – Rahul
    Commented Aug 7, 2014 at 12:25
  • 3
    Because China breaks HTTPS, @Cupcake. Sad but true.
    – Shog9
    Commented Aug 7, 2014 at 23:51
6

This is the response to the report filed by me, from the Stack Exchange team,

Thank you for reporting this content. I've passed the information along to the person at our company who handles such issues. It's the diligence of users like you that helps us stay valuable!

Please note, bringing these sites into compliance (or getting them to no longer serve our content) is often a long and arduous process. You may not see immediate results. However, rest assured that we're working on it.

Thank you again, Stack Exchange Team

So from this we can understand that the target site has been considered under action. and we cannot expect any sudden changes regarding the shutting down process of that site.

4

I commented on a question, bought up the same question on heima588.com and waited. The comment hasn't turned up on the site yet, and it's been 25 minutes.

I browsed the site, and quite a few times I got a message that there's no space left on the host, which appears to be a low-end VPS (judging by the error message and abysmal performance of the site):

enter image description here

It also appears to have hardly any resources available to it since the above error isn't uncommon and sometimes the site ceases to load at all.

As Tim Post mentioned, it's probably a badly-designed (unauthorised) proxy that somebody has set-up to get around a block in their country or ISP, most likely to do with the issues with StackExchange using Google-hosted JavaScript code when China blocked access to Google.

The domain's DNS is with DNSPod which looks similar to CloudFlare.

All in all, it's not a good idea to go logging into the site, since it could potentially get your username/password.

3
  • 57
    Damn, that's one huge taskbar you have there
    – SeinopSys
    Commented Aug 6, 2014 at 15:41
  • hostcabi.net/domain/heima588.com simple data regarding the hosting and whois.domaintools.com/heima588.com for the domain info. Site appears to be blank for me now.
    – Patrick
    Commented Aug 6, 2014 at 15:51
  • @DJDavid98 It's easier to switch between programs, especially when programming (I usually have many windows open, they go onto at least two rows on the taskbar).
    – AStopher
    Commented Aug 6, 2014 at 16:13

Not the answer you're looking for? Browse other questions tagged .