tfrecord type looks like txt or image











up vote
0
down vote

favorite












I'm generating tfrecords of BDD dataset from a 20000 subset. While creating tfrecords I choose 1000 images per tfrecord, and everything is fine.



However if I choose 500 images per tfrecord, some of them (very rare) have type TGA image (image/x-tga) or MATLAB script/function (text/x-matlab) while normally they should have had Program (application/octet-stream) or Binary (application/octet-stream).



Why would this happen, and does it mean the tfrecords are broken?



feature = self._get_tf_feature(
picture_id, os.path.join(full_images_path, f),
m.group(2), picture_id_annotations, new_format)
example = tf.train.Example(features=feature)
writer.write(example.SerializeToString())









share|improve this question
























  • From the code snippet above it's not totally clear how the features are generated. But probably this problem has nothing to do with tfrecord format, because this format just stores the data. That means, that the data, which you provide while generating tfrecords are "wrong" - incorrect image type. Maybe it's worthy to check source images itself.
    – Vlad-HC
    11 hours ago










  • I was thinking the same, but I have a flag for elements per tfrecord, and I only change the number without changing the the way I get the features. Since the records are fine with 1000 samples each, I don't think the problem is caused by the features..
    – kneazle
    11 hours ago















up vote
0
down vote

favorite












I'm generating tfrecords of BDD dataset from a 20000 subset. While creating tfrecords I choose 1000 images per tfrecord, and everything is fine.



However if I choose 500 images per tfrecord, some of them (very rare) have type TGA image (image/x-tga) or MATLAB script/function (text/x-matlab) while normally they should have had Program (application/octet-stream) or Binary (application/octet-stream).



Why would this happen, and does it mean the tfrecords are broken?



feature = self._get_tf_feature(
picture_id, os.path.join(full_images_path, f),
m.group(2), picture_id_annotations, new_format)
example = tf.train.Example(features=feature)
writer.write(example.SerializeToString())









share|improve this question
























  • From the code snippet above it's not totally clear how the features are generated. But probably this problem has nothing to do with tfrecord format, because this format just stores the data. That means, that the data, which you provide while generating tfrecords are "wrong" - incorrect image type. Maybe it's worthy to check source images itself.
    – Vlad-HC
    11 hours ago










  • I was thinking the same, but I have a flag for elements per tfrecord, and I only change the number without changing the the way I get the features. Since the records are fine with 1000 samples each, I don't think the problem is caused by the features..
    – kneazle
    11 hours ago













up vote
0
down vote

favorite









up vote
0
down vote

favorite











I'm generating tfrecords of BDD dataset from a 20000 subset. While creating tfrecords I choose 1000 images per tfrecord, and everything is fine.



However if I choose 500 images per tfrecord, some of them (very rare) have type TGA image (image/x-tga) or MATLAB script/function (text/x-matlab) while normally they should have had Program (application/octet-stream) or Binary (application/octet-stream).



Why would this happen, and does it mean the tfrecords are broken?



feature = self._get_tf_feature(
picture_id, os.path.join(full_images_path, f),
m.group(2), picture_id_annotations, new_format)
example = tf.train.Example(features=feature)
writer.write(example.SerializeToString())









share|improve this question















I'm generating tfrecords of BDD dataset from a 20000 subset. While creating tfrecords I choose 1000 images per tfrecord, and everything is fine.



However if I choose 500 images per tfrecord, some of them (very rare) have type TGA image (image/x-tga) or MATLAB script/function (text/x-matlab) while normally they should have had Program (application/octet-stream) or Binary (application/octet-stream).



Why would this happen, and does it mean the tfrecords are broken?



feature = self._get_tf_feature(
picture_id, os.path.join(full_images_path, f),
m.group(2), picture_id_annotations, new_format)
example = tf.train.Example(features=feature)
writer.write(example.SerializeToString())






python-3.x tensorflow tfrecord






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited 11 hours ago









Vlad-HC

627711




627711










asked 11 hours ago









kneazle

166




166












  • From the code snippet above it's not totally clear how the features are generated. But probably this problem has nothing to do with tfrecord format, because this format just stores the data. That means, that the data, which you provide while generating tfrecords are "wrong" - incorrect image type. Maybe it's worthy to check source images itself.
    – Vlad-HC
    11 hours ago










  • I was thinking the same, but I have a flag for elements per tfrecord, and I only change the number without changing the the way I get the features. Since the records are fine with 1000 samples each, I don't think the problem is caused by the features..
    – kneazle
    11 hours ago


















  • From the code snippet above it's not totally clear how the features are generated. But probably this problem has nothing to do with tfrecord format, because this format just stores the data. That means, that the data, which you provide while generating tfrecords are "wrong" - incorrect image type. Maybe it's worthy to check source images itself.
    – Vlad-HC
    11 hours ago










  • I was thinking the same, but I have a flag for elements per tfrecord, and I only change the number without changing the the way I get the features. Since the records are fine with 1000 samples each, I don't think the problem is caused by the features..
    – kneazle
    11 hours ago
















From the code snippet above it's not totally clear how the features are generated. But probably this problem has nothing to do with tfrecord format, because this format just stores the data. That means, that the data, which you provide while generating tfrecords are "wrong" - incorrect image type. Maybe it's worthy to check source images itself.
– Vlad-HC
11 hours ago




From the code snippet above it's not totally clear how the features are generated. But probably this problem has nothing to do with tfrecord format, because this format just stores the data. That means, that the data, which you provide while generating tfrecords are "wrong" - incorrect image type. Maybe it's worthy to check source images itself.
– Vlad-HC
11 hours ago












I was thinking the same, but I have a flag for elements per tfrecord, and I only change the number without changing the the way I get the features. Since the records are fine with 1000 samples each, I don't think the problem is caused by the features..
– kneazle
11 hours ago




I was thinking the same, but I have a flag for elements per tfrecord, and I only change the number without changing the the way I get the features. Since the records are fine with 1000 samples each, I don't think the problem is caused by the features..
– kneazle
11 hours ago

















active

oldest

votes











Your Answer






StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














 

draft saved


draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53416135%2ftfrecord-type-looks-like-txt-or-image%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown






























active

oldest

votes













active

oldest

votes









active

oldest

votes






active

oldest

votes
















 

draft saved


draft discarded



















































 


draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53416135%2ftfrecord-type-looks-like-txt-or-image%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

What visual should I use to simply compare current year value vs last year in Power BI desktop

How to ignore python UserWarning in pytest?

Alexandru Averescu